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DISTRIBUTION OF THE MEANS DIVIDED BY THE 
STANDARD DEVIATIONS OF SAMPLES FROM 
NON.HOMOGENEOUS POPULATIONS 

By 

G. A. Baker 


In a previous paper^ the distributions of the means and vari- 
ances, means squared and variances of samples of two drawn from 
a non-homogeneous population composed of two normal popula- 
tions' have been discussed. It is the purpose of this paper to dis- 
cuss similarly the distribution of the means of samples of two 
measured from the mean of the population divided by the stand- 
ard deviations of the samples for such parent populations and to 
present experimental results for samples of four. 

CASE n= 2 

Suppose that a population represented by 


( 1 ) 


f(x)-. 


cr,JSn 





1 (X- TVgf 


-N, 777, + Wz -O 

is considered. If ?7-5 individuals come from the first component 
and s from the second in drawing samples of 77 and if rf) is the 
mean of the sample measured from the mean of the population 
and a is the standard deviation of the sample,* then 
( 2 ) 
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2 distribution of the means 

where 777 „ ^and are estimates of the corresponding 

parameters of (1). 

For the case 77 = 2 when both individuals come from the first 
component of (1) it is known that the distribution of the means 
divided by the standard deviations of the samples is proportional 
to 

(3) 

da 


the origin of u. being taken at Similarly, when bot^ in- 

dividuals of the sample come from the second component, is 

cf 

distributed as proportional to 

du/ 


the origin of w being taken at 2^. 

When one individual comes from each component (2) be- 
comes 


(5) 


ff},* Trig ) i 


because no estimate of the standard deviations of the components 
of (1) can be made from one individual, The distribution of fn/ 
is proportional to the first componeiit of (1), and tttj is distributed 
as proportional to the second component. The distributions of m, 
and fWj are independent. 

Expression (5) can be rewritten as 


( 6 ) 


2Z2 

a 


1 - 
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(7) 



If the distribution of v/ is found, the distribution of « - S may 
be found by making the transformation 
( 8 ) 


2 = i- 




or 

(9) 





and 

(10) 


cLv 


_2. 

U-lf 


dx. 


The distribution of v' is a special case of the distribution of an 
index both of whose components follow the normal law. That is, 
we seek the distribution of 



Tiand y being distributed as 
( 11 ) 


Zoe 



(X'X) (y-v) (y-y)^ 

Og CTj* 


This distribution may be obtained as follows. 

Temma I.*»* If two variables * and < oo, -po<y <oo 


* (Loc. cit.) ' 

'Baten, W. D. “Cotnbiningf Constant Probability Functions”— /I meri* 
tm Mcftkunatical Monthly, Oct., 1930. 
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are so related that the probability of an z indx and of a y in <2'yis 
f(x,y)dxdy 

then the probability that v-x-y is mdvk proportional to 


/ W 1 

f(v+y,y)dy d^. 

00 


Consider, first, the portion of (11) in the first quadrant. 


Vs 


y 

1C 


Put 


and take the logarithm of each side, thus 


( 12 ) 


log V = log y- log X . 


Put 


1 = log V 
w s log y ' 
a * log X 

and (12) becomes 

(13) * I*^~u 

where the range of w and a is - oo to -kso . The equation of the 
correlation surface of w and a is proportional to 

(14) 




. . (eWe^.y) , 


~oF 


F(w,w)=c'*e'*''e. 

Hence, F(u,I-t-a)da when the transformation e ‘^sX is made, 

becomes 

(IS) 


x.e e 


i 1 [ Zr(x-g)_ rxg^ 


dx 


where x ranges from O \o‘v>. By the application of Lemma I 
the proportional probability of a value of v in cfvwhen x andy are 
both positive is obtained by integrating (IS) from t? to ew v\rith 
respect to X and making the transformation Thus, 
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(16) 




-L 

^ 1=7^ 


^.Og, 



Qcv-y) 


“ / 




+ ^ K- e 
IT ^ 

where 


(Zv~fy 

za 


a a Z rcr, cf^,^ -*' 


b^(cr,‘^y - ra, dg^ y.)\/+(a^x - rd^a^y) 


is obtained. The distribution of v if both x and y arc negative 
(and htnce \/ positive) is the same as (16) except that the last 
term is reversed in sign. Thus, for y positive the distribution of y 
is proportional to two times the first two terms of (16). If V is 
negative, i.e. x negative and y positive or x positive and y negative, 
(16) is obtained in one case and (16) with the sign of the last 
term changed in the other case. That is, the distribution of v is 
proportional to the first two terms of (16) when v ranges from 
~bo to + oi'. 

In our case r^O, m,~x, Hence, the distribution 


of y becomes proportional to 
0, <dg e L^' ^2 J 


(my- , 

mgv+d^m^ z(af-f0*v*:) f (r,0g^a^+a*v^ 

From (8), (9), (10), and (17) as is distributed as proportional 
, to , 
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whfre 


and 

3 ■ * (a^ 711, ~ 

The origin for % is at the mean of population (1 ). 

Thus the distribution of -5- for samples of two drawn from 
a population represented by (1) has been completely determined 
as being proportional to 
(19) 

k,(3)+kj4)*ks(l8). 

Let be the respective areas under the curves repre- 
sented by the three terms of (19). Then k„ /^,and to 

be so determined that 

A, * -N 

where A/ is the total number of samples considered, and that 

L J ^ ijL 

Af Az As 

where 




G. A. BAKER 


7 


Expression (19) indicates, in general, that if the means of 
the components do not coincide and if one component is not large 
compared with the other, both tails of the distribution of the 
means of samples measured from the mean of the population 
divided by the standard deviations of the samples are heavier for 
populations of the type ( 1 ) than for normal populations. In case 
the means of the components coincide one tail will be heavier 
^ / cf^X case at least one tail will be heavier. 


' EXPERIMENTAL RESULTS 
Samples of four were drawn from a population approximate- 
ly represent'd by 
( 1 ) 




/ 




(X-/A.5) 


e 


64a 


Z5 


which is the same as Population I in the first reference. These 
samples were drawn by throwing dice. The means of these 
samples were calculated and referred to the mean of (1) as an 
origin. The standard deviations of the samples were obtained and 
^ calculated. A grouped frequency distribution of 1038 of these 
values is presented in Chart I and in Table I. Large values are 
obtained more frequently than would b^ expected from a normal 
population. 
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TABLE I . 

Grouped Erequency distribution of 1038 Values of 759/5* 


for Samples of Four from Population I 


1 Middle of Interval 

Frequency 

—6.0 

1 

-5.6 

' 0 

— S.2 

1 ’ 

-4.8 

2 

—4.4 

1 

-4.0 

1 

—3.6 

4 

—3.2 

6 

—2.8 

4 

—2.4 

10 

—2.0 

11 

—1.6 

9 

—1,2 

42 

—0.8 

71 

—0.4 

185 

0 

294 

0.4 

183 

0.8 

100 

1.2 

40 

1.6 

23 

2.0 

14 ' 

2.4 

9 

2.8 

9 

3.2 

8 

3.6 

4 

4.0 

0 

4.4 

1 

4.8 

1 

S.2 

1 

5.6 < 

0 

6.0 

1 

6,4 

2 

6.8 

' 0 

7.2 

0 

7.6 

0 

8.0 

0 

8.4 

1 







A STATISTICAL APPROACH TO MATHEMATICAL 
FORMULATION OF DEMAND-SUPPLY- 
PRICE RELATIONSHIPS 

By 

Robert W. Burgess 

A scientific approach to the practical problem ol forecast- 
ing the prices' of commodities clearly requires the develop- 
ment of methods of a somewhat mathematical type for 
analyzing the relationships between demand, supply, cost, and 
price. In the case of cotton and other annual crop agricultural 
commodities, the multiple correlation, link-relative and trend- 
ratio methods as applied by Moore, Schultz, B. B. Smith, 
Ezekiel, Holbrook and E. J. Working, and others, have dem- 
onstrated their worth. But for copper, lead, rubber, and similar 
commodities not on an annual crop basis, where quantity 
produced, or the quantity available, in a given period cannot 
logically be considered as the supply linked to the average 
price of that period, the method seems inapplicable and an- 
other type of approach is necessary. For this reason, and be- 
cause of the failure of price to function as expected as a major 
regulator in our present money economy, it seems worth 
while to attempt to develop a general mathematical procedure 
involving cost, demand and supply functipns and to analyze 
elasticity of supply and elasticity of demand as a mathemati- 
cian naturally does. But some mallhematical studies along 
these lines have not seemed to represent a truly scientific ap- 
proach to the problem, however helpful they may be in sug- 
gesting potentially valuable ideas. 

The mathematical economist of the non-statistical type 
, sometimes seems to believe that he has contributed to the 
splution of economic problems if he finds an answer in the 
form of a mathematical equation with undeterinined con- 
stants. The determination of these constants is left as a sec- 
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ondary step to the statistician. But in certain cases, at least, 
the determination of these constants would be at least as dif- 
ficult as the original problem. Moreover, the. assumptions 
made in setting up the equations, while not always stated 
explicitly, ha,ve not been shown to constitute a sufficiently 
close aproximation to actual non-exceptional conditions for 
the analysis to be useful. It may be suggested that, according 
to correct scientific procedure, careful statistical analysis of 
known and knowable conditions is necessary before worth 
while mathematical formulation can be attempted. 

To bring out my point of view by discussion of a particu- 
lar equation, Professor Evans,, in his “Introduction to Mathe- 
matical Economics,” uses as a cost function, that is to say, a 
function stating total cost for goods produced in a given unit 
of time, in terms of the .quantity produced, it, 

\ 

q(u) = Au^*Bu -hC 

This formula involves several unstated assumptions: 

(a) That the same continuous formula will apply over , 
the entire range of ii which may appropriately be consid- 
ered in discussing a particular problem. Actually, several 
points of discontinuity would be more normal. In fact the 
natural method of statement of this relation might be mere- 
ly a number of discrete points rather than a function .de- 
fined for all values of u. For instance, under the conditions 
of a particular problem, increase in production might be 
accomplished solely by increasing the number of machines 
engaged in that process. If so, any rate of output other 
than an integral multiple of the normal output of the ma- 
chine might represent inefficient operation and therefore 
be barred from further discussion. More generally, I should 
expect a very high unit cost for low values of ti, when pro- 
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duction is on a job basis, with abrupt downward steps as 
quantity production methods are applied, and finally a near- 
ly horizontal line. 

(b) That a term of the type Au^ is worth consider- 
ing in the typical case, Some analysis seems called for as 
to when this term is appropriate. In mining and agricul- 
ture, some such increase of cost with increasing quantity 
undoubtedly does occur, but it is difficult to imagine prac- 
tical cases of factory production in which unit cost in- 
' creases with quantity at any such rate if advance notice 
of contemplated increase, has been given. If production 
beyond present normal plant capacity is desired, such forced 
production- might involve some temporary increase in unit cost 
as production increased, but in these days such a case would 
be unusual. 

(c) That the variation of unit cost with quantity is 
important enough to justify singling that out as the single 
factor of variation, although actually my impression is 
that fqr rather broad ranges of quantity, other factors are 
more significant causes of variation in costs,' Such factors 
include regularity of production rate, the weather, labor 
conditions, character of supervision, and management pres- 
sure reflecting price conditions for the product. 

(d) That ^ is an adequate expression for the element 
of unit cost which decreases as u increases. Accounting 
discussions of total cost usually emphasize, in addition to 
-constant overhead, certain elements of cost which increase 

somewhat . with the quantity produced, but not in direct 
proportion. There is also the very sharp reduction in unit 
costs when production is first initiated. As a first hypothe- 
sis as a basis for detailed statistical study, I suggest that 
a term of the type Ce might be useful, either alone 
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or with the term , to cover the elements of unit costs 
which decrease as u increases. 

(e) That some definition of cost of a logical nature 
can be framed and applied to this case. 

The statistical aproach to a cost function might well 
show that the range of applicability of a continuous formula 
is rather narrowly limited and that a careful statement as 
to attendant conditions is at least as important for reliability 
of the results as precise determination of the constants. Un- 
fortunately, the results of statistical experiments along these 
lines are not available. In these days of stiff competition and 
rigorous government regulation,’ industrial concerns will be 
reluctant to permit publication of the kind of analysis of 
costs which is essential to what seems to be the correct sci- 
entific approach to this problem, It is hoped, however, that 
the preceding discussion has been specific enough to show 
the nature of the analysis which I think should precede the 
formulation of an expression of functional relationship, and 
to suggest the kind of discussion which a mathematical 
economist should include in his results. 

Returning to the general criticism that the treatment of 
supply-demand-price relationships by certain mathematical 
economists is not truly scientific, let us review in broad lines 
the history of scientific progress in those lines where it has 
admittedly been successful. The steps ' have been about as 
follows : 

1. . Creation of a serviceable mechanism for the meas- 
urement of data. In the case of astronomy, for instance, 
this required the invention of the telescope and general 
agreement on angular measurement. 

2. Careful making and recording of observations with 
this approved system of measurement, the observations 
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being in certain cases of events over which the scientist 
has no control and in other cases of experiments whose 
conditions could be modified at his convenience. 

3, Derivation of empirical laws from these data. 

4. Discovery of fundamental principles. 

It is true that these steps are not in fact separated as com- 
pletely as the outline might suggest, and that attempts at 
the discovery of fundamental principles often help in formu- 
latlng-the plan according to which observations are recorded 
and give the workers a motive for intensive effort. I think 
the statement ^will stand, however, that very few discoveries 
of fundamental principles have been made until substantial 
results have been secured under the (1), (2) and (3) headings. 

For example, Newton’s astronomical laws were discov- 
ered after Huyghens and others had made the telescope a 
useful instrument, Tycho Brahe had made an enormous num- 
ber of observations, and Kepler had deduced empirical laws 
(or those observations. 

Again, in actuarial science, the whole structure of mod- 
em life insurance became possible only after careful vital 
statistics had been recorded for many years and analyzed by 
the empirical laws of Gompertz and Makeham, 

On the other hand, when we turn to treatises on theo- 
retical economics or to books and articles on mathematical 
economics, there seems to be no ^race of the careful record- 
ihg of observations or their analysis by empirical laws as 
the basis for their theoretical discussions. I admit, of course, 
that mathematical formulas are stated which look like em- 
pirical laws, but no references are given to any studies justi- 
fying these particular formulations. If we imagine ourselves 
starting the scientific procedure described above as the basis 
for arriving at real economic laws, we note almost at once 
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that agreement as to the meaning of fundamental terms has 
not yet been secured. For instance, cost of production is one 
idea which is fundamental in analysis of demand-supply-pricft' 
relationships, but cost accountants and economists are by 
no means agreed among themselves as to what the term 
should cover. Under -the circumstances, it seems to me that 
the most profitable scientific approach at present would be 
to analyze various relationships which can be put on a quan- 
titative basis, with special attention to noting all the special 
circumstances of the cases analyzed. For example: 

(a) In many cases it would probably be possible to 
study the relationship between the price of a manufactured 
article and the price of the raw material or the r4w mater- 
ials used in making it. A simple case which I have actually 
done in my office and used in price estimating is determin- 
ing the price of cotton yarn in terms of that of raw cotton. 
We find it advantageous to compute the cotton yarn price 
according to the formula, compare that with the actual 
price and note the relation to general business conditions 
or to competitive conditions within the industry. The sta- 
tistical methods required for a problem of this type are 
obvious, but relatively little, I believe, 'has been done on 
this line. Broad comparisons of index numbers of prices 
of finished goods and raw materials I regard as another 
kettle of fish altogether, 

(b) Relationship of change in price to change in stocks, 
Any study of actual data of the commodities, such as cop- 
per, lead, and rubber, shows that price tends to decline 
when stocks increase and rise when stocks decrease. A 
first step in the quantitative approach to price forecasting 
is to obtain a more precise formulation of this correspond- 
ence. It may be noted that some rather vague mathemat- 
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ickl ideas come to the surface in discussions of these rela- 
tionships. For instance, if in a given month production' 
has decreased and consumption' has increased, it is some- 
times said that these are two arguments for higher prices, 
and prices are expected to rise. But it may happen in cer- 
tain cases that even with such a decrease in production 
and increase in consumption the month’s production still 
exceeds consumption and stocks arc increasing. On the 
whole, then, the monthly figures point to lower rather than 
higher prices, and it is a useful dijty of the mathematician to 
point this out. 

An audience of mathematicians probably regards the 
preceding illustration as trifling. I bring it up to illustrate 
the fact -that progress toward a more mathematical atti- 
tude in commodity forecasting must proceed step by step. 
A more advanced stage in the quantitative formulation 
is, of course, to determine the equation connecting change 
in price and change in stocks as reported monthly. This 
also has actually proved useful. 

(c) Exact definition of the phrase "cost of production" 
as actually effective when the problem is;. 

1. Establishing an appropriate price under regulated 
monopoly conditions. 

2. Determining which manufacturing or mining en- 
terprise will survive. 

3. Shutting down established sources of supply, 

4. Creating new sources of supply. 

As I see it, there are at least four costs of production, caci 
of . which is important under certain circumstances. 

1. Complete economic cost, including interest on er 
tire investment at an appropriate rate. 
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2. Economic cost, excluding interest on the capital 
value of ownership. 

3. Out-of-pocket expense, which excludes depreci- 
ation charges in excess of actual replacements in the 
period considered, interest on investment, and design or devel- 
opment expenses. 

, 4. Economic cost plus reward for the enterpriser 
over and above interest at a reasonable rate on his in- 
vestment. 

Roughly speaking, it may be that these are in order closely 
related to the costs required for the problems stated above. 

(d) Analysis of changes in cost at different price levels. 
One point which has been emphasized by the experience 
of the past year or two is the fact that cOsts are by no 
means kept constant when the price varies. Eor instance, 
the best information available two years ago suggested 
that at 18^ New York, certain producers of rubber would 
begin to drop out because of costs above that figure. As 
a matter of fact, they seem to have been able to change their 
costs. Such changes are possible by several means, for in- 
stance ; 

1. Wage differentials varying with market price. 

2. Bonuses for officials varying with profits. 

3. Exploitation of beat ores or plantations in times 
bf lower prices. 

4. Increased pressure for efficiency when essential. 

(e) Careful analysis of the relations of cost to quantity 
produced with the consideration of j 

1. The time ahead that the quantity is known to be 
required. 

2. Continuous production versus intermittent produc- 
tion, perhaps in lots whose size has been determined to 
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be most economical in view of manufacturing and distri- 
buting conditions. 

3. Mass production or production in smaller quanti- 
ties by trained mechanics with attention to the discontin- 
uity in costs per unit when the transition from one type 
of production to another occurs. 

In view of the considerations suggested in (1), ( 2 ), and (3), 
it does not seem to me that the assumption of an algebraic 
formula connecting cost with quantity produced represents 
an adequate realistic formulation of the problem. 

(f) Analysis of way elasticity of supply or demand act- 
ually works. Such an analysis seems to me essential before 
a definition of coefficient of elasticity is framed or made the 
basis fdr elaborate developments. Recent experience has, I 
think, shown : 

1. Elasticity of demand does not mean shrinkage of 
demand with high prices to anything like the extent ex- 
pected. 

2. Elasticity of supply functions much more slowly 
than expected at the low-price end of the spectrum. The 
frictional factors include sympathy with employees who 
would lose their jobs if economically unprofitable production 
ceased, reserve funds which permit companies to continue 
operations when even out-of-pocket expenses are not being 
fully recovered, bank willingness to lend on commodity stocks 
which are not, in fact, marketable, within a short time, and 
the cost of shutting down and reassembling a working force. 

Most of the matters discussed in (a) to (f) above cannot 
be analyzed fully on the basis of published records. More- 
over, in view of the fact that price information is part of the 
life blood of any particular business, it will prove difficult 
lor outsiders to secure much of the information which would 
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be needed to complete the analysis. A possible plan would be 
to place research specialists in industrial or marketing con- 
cerns to study actual data. It is not probable that the best re- 
.search of this type can be done in academic halls as a sideline 
to teaching. It is also unlikely that corporation officers with 
an ax to grind can themselves complete the scientific analysis of 
such material. It seems clear, however, that a satisfactory 
solution requires a coordination of both points of view. 

Summarizing the point of view I have tried to outline, I 
believe that the analysis of cost-price-supply-demand relation- 
ships should be relatively more inductive than it has been, 
especially in .the type of theoretical work classified as mathe- 
matical economics. On the other hand, I think that the deduc- 
tive approach is worth while/ that we should try to formulate 
general principles in this field, and that the ultimate ideal in- 
volves a mathematical form, — though, perhaps, when it really 
covers price situations as they are, a mathematical form some- 
what dififerent from those required in the physical, chemical 
and astronomical sciences. 



THE DISTRIBUTIONS OF THE PRECISION 
CONSTANT AND ITS SQUARE IN SAMPLES 
OF n FROM A NORMAL POPULATION 

By 

H. U. FttDMAN 
W?»hlngton University 

INTRODUCTION 

The following paper is a study of the properties o£ the dis- 
tributions of the precision constant and its square in samples of 
from a normal population. The properties studied are (1) modes 
and optimum values, (2) the first four moments, (3) skewness 
and flatness, and (4) medians and quartiles. 

The distribution curves shown in the figure are for 77*4, 
10, and 25, All the curves are drawn together and to the same 
stale, so that a graphical comparison of the two distributions can 
be easily made for both the same and diflFerent values of' 77, The 
numerical values for the various parameters given in the tables are 
for 77*4, 10, 25, and 100, except in the case of the medians and 
quartiles where the values for 77 » 100 are omitted, and in case of 
71 ■ 4, no moments higher than the second exist for the precision 
constant, and none higher than the first for the precision constant 
squared distribution. 

I, Dish’tbntions 

Let us denote the standard deviation, precision constant, and 
the precision constant squared of the parent population hy S, H, 
and U, respectively, and those of a sample from the given popula- 
tion by s, h,and u, respectively. The standard deviation, 3 , is then 

defined in terms of the variates, , and its mean 

K by the equation 
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Wc also have the following well known relations between 5, H , 
zxiiU, oTSfh, andu: 


~jih 


( 1 . 1 ) 


3 *=-^ » or u-h^ 

Za 


( 1 . 2 ) 


The distribution of s as given by R. A. Fisher* and other's is 

--4^ 

77-J 71-Z >Jes 

77'^5 ^ 

f(3)ds = = • • • • (' 1 . 3 ) 

^12^^77-1 r(^ 


where n is the number of items in the sample. 

The distribution of 3 and the transformations (1.1) and (1.2) 
enable us to find the distributions of the precision constant h, and 
its square u. Thus, using (1.1) and (1,3) we get 

■ „ ^ "^dh 

F(h)dh- .... (1.4) 


Z ^ 




and by means of (1.2) and (1.3) we find 

^ u du. 

7SfF(^) • ■ ■ ■ <‘'S) 


2. Modal and Optimum Values 
We shall now obtain some of the properties of the distribu- 
tions of our parameters. The simplest of the properties of any 


♦See, for example, R, A. Fisher, Applications of “Student’s” distribu- 
tion, Metron, VoL S, No. 3, (Dec. 1, I92S), pp. 90-104. 
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continuous distribution is its itiodsl vsluc or tbc nbscissn of the 
maximum ordinate of the curve. Denoting the modal values of 
the distributions (1.4) and (l.S) by/7 and u respectively, we 


find from the condition for an extremum 


dh 


d4> 

du 




H-H ( 2 . 0 ) 

( 2 . 1 ) 


In obtaining h and u we’ regard h and u as variables and H 
and U as constants. We may, however, reverse our point of view. 
That is, we regard H and U as variables and h and u as constants. 
In that case the right hand sides of (1,4) and (l.S) become func- 
tions of H and U, and from 




( 2 . 2 ) 



m 


The quantities A/ and f/R. A, Fisher calls the opthnum values, 

3. Afomsfl*j , 

Precision Constant 

In order to distinguish between the moments of the two dis- 
tributions treated in this paper, we shall denote the tth moment 



H. M, PSIDMAN 


23 


of the precision constant about the origin by /ll[. (n) and that of 
its square by ^u^with similar notation for the moments about 
the mean. 

Using the general definition of a moment of a continuous 
distribution, we obtain for the first moment of n, which is also its 
mean 





■/ 


.nMt 

dh 


( 3 . 10 ) 


To put this into an integrable form we make the transformation 
j. nH^ 


( 3 . 11 ) 


This yields for the first moment 

^ ( 3 . 12 ) 

To facilitate calculation we express this in terms of factorials. 
For this purpose we have two cases to consider, namely, the case 
when 77 is even, and that when n is odd. 

When n is even is an int^er, and, 




,^^iT7‘4)t7h6)‘ ■ J? 




*HJn 


r(^) 


Tn-dXrj’^)" 1 Jrr 

( 3 . 13 ) 


When n is odd is an integer and hence 

M, M Jg - Hjn ...2 /£ 
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^yjfr ■■■■ 

Similarly we obtain for the second, third, and forth momenta 
of the distribution of about the origin the following expressions : 



(3.15) 




. (3.16) 




nf ^ 


- (3.17) 


MomnU about the Mean 

To study such properties of a distribution curve as skewness 
and flatness we must have the moments of the curve abbut the 
mean. To obtain these we use the well known formulae for ex- 
pressing the moments about the mean in terms of the moments 
about any origin. Using these formulae we obtain for the first 
four moments of the precision constant, A?, about the mean the 
following expressions: 


W] H‘ (3.18) 



\Z(Tt'3)(r,-4.)/j:^~ nfZr,>0)Mi H * 


. (3.19) 
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/i_\ \jf^( n-4hZ 

7^ 3 )t^- 4Xn.J) 

(3.20) 

where ju^ (h)h given by (3.12). 

For future use we shall give here approximations for /ulfh), 
ja^(h), M 3 (^y> M 4 .Ch). The approximations were 

obtained by expanding the various quantities into power series of 
•jj. The derivation of these are not difficult but rather long and 


will therefore not be given in this paper. 

These approximations are as follows : 



( 3 - 22 ) 

(3.23) 

(where a is a conststnt) 

, • • • • (3.24) 


Precision Constant Squared 

The first moment of the precision constant squared distribu- 
tion is, using the general definition of a continuous distribution 
curve about the origin 




„n^ UU^ 

r 




. . (3.30) 


We reduce this to a known integral by means of the transforma- 
tion 



• • • • 


(3.31) 
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We then obtain for the first four moments of tx about the origin 
the following simple expressions: 

^ 3 . 32 ) 




Ms M- 




(n-^XrhSXT7-7) 


U‘^. . . . 


(77-3Xn-3Xn-7).M-9^ 




. , . . ( 3 . 33 ) 
. . . . ( 3 . 34 ) 
. . . . ( 3 . 35 ) 


Moments aJboui the Mem 

For the moments abofit the mean of the precision constant 
squared distribution we have: 


(r}-5)^<n-3) ^ \ 

r»-3)*{n-P)(n.7}(n-9) ^ 


( 3 . 36 ) 

( 3 . 37 ) 

( 3 . 38 ) 


Skewness and flatness ' 

From the above expr^ions for the mean and also from the 
numerical values given in the tables we may conclude that the pre- 
cision constant distribution is less skew than the distribution pf 
the precision constant squared, at least for values of 77 up to 100, 
But what happens when ?7 grows very large? To answer this we 
make use of the Pearsonian measure of skewness, ^6), defined by 


A 




. . . . ( 3 . 40 ) 
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Since we are now interested in large values of rt we make use 
of the approximate values of /u^(h), , which are 

Hence we get 

Hm /im ^ 


Ttitt 

and Um 

Ttitf 




‘J.O 


(3.41) 


To find /S^ fill wt make use of the exact values of Mt.M 

and /<j <'u;which are given by (3.3^ and (3,37). This gives 


Um fii(a}*Um 

Tji*< 


r l6r,^U^ 1 


r I 

\(n- 3) ^(n-SX n-7) 

t _ ^ 

(■n~3)^My 


... JAfrr-S) - 
® c/ 

nif 


(TT-rj^ 


(3,42) 


From (3.41) and (3.42) we learn that both the precision- constant 
distribution, and that of its square, approach perfect symmetry 
as the size of the sample, 77 ., approaches infinity. 

Hie fiatness or kurtosta of a curve is measured by the 
quantity 




(3.43) 


From this and the expressions (3^ and (3.34), (3.33) and 
(3.35) we get 


Urn Sjh) “ Hm 

nit0 ** 



(3.44) 


wd 

/• A . 4n*u 
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(■n-5)(Zv-i-27) , 

thm — — — ‘ • 

(n-7)(r,.9) 


. . (3.45) 


may conflude, then, that while the distributions of the pre- 
'fWision constant and its square are both perfectly symmetrical for 
f |very large values of 77 , they are nevertheless entirely distinct clis- 
( tribution curves for both small and large values of 77 , since Um 

TtmU 

and As a matter of fact the dis- 

tribution of the precision constant approaches the normal curve, 
while the precision constant squared distribution approaches a 
curve of the form 

i 

where y; and a are constants. 

I 

4. QmrtUes and Medians 

The quartiles of a continuous distribution fM may be 
defined by the equation 

^ (U 1,2,3) (4.00) 

For t® 1, Qi is called the lower quartile; for i * 2, Qi is called 
the median and for i * 3, (?,• is called the upper quartile, 

In order to find the quartiles of the distributions studied in 
this paper we must make use of the incomplete T- function. This 
function is defined as follows 

. ^uJUp 

i • • • • (4.10) 


Pearson’s "Tables of the Inconplete t -Function” give the values 
otlfU^ andp. Thus, if we know any two of the variables we can 
easily find the third. 



H. M. FELDMAN 


29 


Let us take, now, the distribution of h, 


F(h)dh' 


■ z^r(^) 

The various quartilts of this distribution will be given by 


By making the transformation' 
(4.11) is reduced to 



(4.13) 


This is, of course, since i and u are known 

we can easily find from Pearson’s Tables. 

Comparing (4.11) and (4.13) and taking into account the 
transformation (4.12) we find that * 

rfi 

(Zn-2)i 

Since the lower limit in (4.13) is oo instead of O ,we see that the 
lower and upper quartiles are reversed. 

To find the quartiles for the distribution of a we rimply make 
use of . the relation Um h\ 

In conclusion we may state that the. results of this paper are 
similar to those of Professor Rietz on the distributions of the 
standard deviation and the variancci* 

*Rietz, H. L. A comparison of the distributions curves of variance and 
of standard deviation, Mathmaticci Monthly, Vol. 36 (Augrust-Sept., 1929), 
p. 3SS. 
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TABLE I 

Modal and optimum values for the distribution of the pre- 
cision constant and its square from a normal population for 




Pr. const, h 

Pr. const, squared a 

n 

Mode 


Mode 

Optimum 1 

4 

■■ 

HESQI 

0.800 t/ 


10 



0.909 U 


25 

HI 

1.021 H 

0,962 V " 

1,042 U 

100 

HI 

l.OOS w 

0,990 t/ 

1.010 U 


TABLE ri 

Values of the mean and the first four moments about the mean 
of the distributions of the precision constant and its square for 
sa mples of 4, 10, 25, and 100 from a normal population. 

Precision constant 


V 

Mean ju't 

Mi 

Mi 

M*. 

4 


1,454 



10 


0.0982 


0.0815 H* 

25 

1.054 ^ 

0.0255 


0.0026 N* 

100 

1.013 A/ 



0.0000907 Ay 


'recision constant square U 


L7L_ 

A 

^ 

M, 

Ma. 


4.0001/ 

1.4291/ 


3.110 4/’ 

52.200 !/■* 


1.136 U 

0.129 

0.065221/^ 

0.892 [)* 

i 100 

1.031 U 

mmZwSM 

0.0199 

0.001291/^ 


TABLE III 

Values ot/$, and the distribution of the precis^ion 

"constant and its square for samples of 10, 25, 100, and oo from 


a normal population. 





u. 1 

7» 

A 

/9a 

/5, 


10 

4.8548 

8.4521 


78.3416 

25 




5.3172 

100 

0.00134 

3.2289 


2.5476 

msSLss 


3,0000 

0.0000 

2.0000 
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TABLE IV 

Medians and q^uartiles for the distributions of the precision 
constant and its square for samples of 4, 10, and 25 from a normal 
population. 


Precision constant h Precision constant squared u 


Median I Q. \ Qi I Median 


81 

0.937/7 1.094 W 1.23 

0.941W 1.036/V 1.146 0.885^/1 1.073 </ I 1.313 1/ 


Distribution curves of the precision constant and its square 
for samples of 4, 10, and 25, from a normal population. 

Note that the mode for the precision constant distribution is 
the same for all values of nsts is seen from (2.0). 

The solid curves are for the precision constant,, the dotted 
curves for its square. 

The unit used is the standard deviation of the population 
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A POSTULATE FOR OBSERVATIONS 

By R. Hi^NcissoN 

When measurements are made by a given observer using a 
particular instrument, if the mean or expected result of the mea- 
surements is not supposed to be equal to the true value of the 
quantity measured the difference is considered In' be an error, 
personal or instrumental or both. A correction is therefore ap- 
plied to any such measurement in order to remove the discrep- 
ancy. In other words a given combination of observer and instru- 
ment is not considered to give correct or balanced measurements 
until this discrepancy is removed. Also as between two instru- 
ments or observers, both giving balanced measurements after the 
application of known corrections, preference is given to the one 
which shows the smaller variations between different measurements 
of the same quantity. 

In selecting the formula to be used to determine, from the 
results of a series of measurements involving certain unknown 
quantities, the best measures of those quantities we are in a posi- 
tion similar to that of an observer desiring to make a certain 

I 

measurement and selecting the best available instrument for the 
purpose. Such an observer would, in the first place, require that 
the instrument should give balanced measurements and would, in 
the second place, among a number of such instruments select the 
one showing the smallest standard deviation. This suggests the 
following definition and postulate. 

Definition A balanced measure of a quantity is one of which 
the mean or expected value is equal to the true value of, the quan- 
tity measured. 

Postulate Of two or more balanced measures of a quantity 
the best measure is the one which has the smallest standard 
deviaition. 
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Repeated Measurements 
When we have a number of different results 

^1 > 

of balanced measurements of the same quantity a , then any ' 
function of the form 

■f) Qi * ^zCig i- ‘ ■ -h Ct, 


will also be a balanced measure of the quantity. If the standard 
deviations of the individual measures are respectively 

the square of the standard deviation of the function will be 


7T^r7~rrrrj;^ 

This is a minimum when 

the best measure is the average ot the individual measures each 
weighted inversely as the square of its standard deviation. If 
the individual measures have the same standard deviation this 
reduces to the ordinary arithmetical average. 

Combinations of Observations, 


In applying this postulate to the theory of combination of 
observations, suppose that there are rt unknown quantities 


Xi(C«l,2J ■ ■ • -n and that we have balanced measures of 
infm >n) linear functions of these unknowns of the form 
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For simplicity we shall assume that the functions have been so 
taken that the standard deviations of these measures are all equal. 
Then a linear function of these measures of the form. 


h’m L^n 


( 2 ) 


will be a balanced measure Qi if 

/>«>T7 


( 3 ) 


where if and Sj" •‘0 if 


htm 


]s a 


It will be the best measure of that type if £ % ’ 

' minimum subject to those conditions. > 

By the method of indeterminate coefficients we find that this 
occurs when we can write 


^Jk^hk’ ( 4 ) 

and the values of the 77 * coefficients 
ijk are determined from the 77 ^ conditions (3). Then 

k*n htm 

5/ i^k (S) 

) 

hnfft 

it we write for Z ^ . ,The value of each of the 

n*f 

77 unknowns xj is thus expressed in terms of the rr func- 
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tions yj^ or, in other words, may be determined from the n 
.^nations expressing in terms of the m unknowns xj 
These equations take the form, if we write 

A jk^j^y^ ( 6 ) 

/«/ ^ 

It will be noted that although no assumption regarding^ the law 
of error, other than that of balance, has been made the equa- 
tions deduced are the same as those derived, in the ordinary 
theory of least squares, from the assumption of the normal ex- 
ponential law. 

Measurement of Probabilities. 

Where the quantity measured is a probability and the measure 
is to be determined from the observed result of a finite number of 
trials we know that, if the probability is p the number of 
trials n and the number of occurrences of the particular re- 
sult r then the expected value of r is n/sf Consequently r/rj 
is a balanced measure of p . The measure uSuaily associated 
with Bayes’ theorem, namely is not a balanced measure. 

Its mean value is y which is not equal to p unless 

p happens to be equal to 

For this case a diflferent postulate might consistently with 
the general methods of science, have been proposed as follows. 
Thxt hypothesis is to be adopted which makes the compound prob- 
ability of the hypothesis and the observed facts a maximum. If 
then we considered one value of the probability as likely as an- 
other this would mean selecting the value of p which would 
make p ’'( a maximum. This would have given 
r/77as before. 
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Prequencyi DistTihutioiu 

The notation on the subjefct of moments is so unsettled that 
it appears to be necessary for each writer to specify the notation 
adopted. In this paper the /rfth moment about the origin in a 
finite sample will be written and the corresponding moment 
about the mean value will be designated by . The moments 
in the population from which the sample, is drawn will be written 
977 /. and respectively, 

In this connection an important consideration arises from the 
fact that balanced measures are not always consistent under ordi- 
nary mathematical transformations. This happens because if y 
is a balanced measure of x then f(y) is not necessarily a bal- 
anced measure of Let y»x-hh and let mean 

values be indicated by prefixing 7^^ so that 

fn, (h)» fh, (y)-x. - O. 

Then since 

hf'Mf ^ f "(x) i-ttc 

we have » /A . 

RRf, ^f(y)) * ^ ^ ^ 

Ordinarily therefore unless f^xj js a linear function of x 
dr, if not, y is an exact measure of fn^ ^ fCy) } will 
not be equal to ffiO . 

A simple illustration of this fact arises in connection with the de- 
termination, from a sample, of a measure for . By ordinary 
transformations we have the well known formula >2^ 

Also 777^ and njj are balanced measure of 77}^ and 777, re- 
spectively but 777* is not a balanced measure of . We 

have in f adt. yf 7 ,( mf) m Ju ^ , Therefore, 

(m,, - mf) » - wt* -X . (t-l . 

The balanced measure of would therefore .be ^ 
which is not formally consistent with the balanced measures of 
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iv, and in the light of the equation JUg, * ihg- nif. By a, 
similar line of reasoning as shown by Thiele, we obtain 
(n- ifpri.g) '^■3 balanced measure of pj and by 
Tschuprow’s modification of Thiele’s analysis 


n 


i^(r}‘~Zn+3) jLi^-3(X.u-3)ju.l ] 


(n-l)(n~Z)(r7-3) 
as a balanced n^easute of . Here however we are faced 
with the further difficulty that while this is a .balanced measure^ 
its standard deviation for small values of r? is so great that 
possible values of and would result in negative values 
of whereas in any real frequency distribution not only, must 
pght positive but ( pi -p\ -p\ ) which is the mean 

— 2 must also be positive, 


value of 


6 


If, therefore, we wish to derive a value of p^ certainly satis- 
fying this condition we must determine the average value of 




for all combinations of three 


values from the sample of n and use it as a balanced measure of 
(pjtp 4 .’/Ug -pg ) This average value is found to be 

The analysis is as follows. 

6 } 

“ ^a, ^ ~ “.*5/ ~ ^1 } 

^ -L..U 5 

** mg + Z rftg trig m, ~ frt^ trt^ - ~ nig 
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If repetitions were allowed in the finite sample the avferage 
value would be the corresponding expression in moments of the 
sample, namely, Bwt since the ex- 

pression vanishes if two or more of the values of x involved are 
equal the exclusion of repetitions reduces the total number of 
^rmutations three at a time from 7)^ to ti(v-i)( v-Z) 
without reducing, the sum of the values. The average is there- 


increased to 


77 




The second moment mi|ht have been similarly derived , 
as the mean value' of and the third moment /Zg as 

the mean value of 






)(Z. 




•Xf )(Z 7i!g ■ ) 


6 

In this latter case the expressions averaged do not vanish when 
two only of the value of % are equal but they cancel one an- 
other in pairs so that their sum vanishes. 

We have thus as working approximations 




JL 

n-i 


_ 77 ^ 

^3^ (n-l)(n~Z) ^3 

fis-Az “ ) 

The net result of this investigation of the application of bal- 
atfced measures as presumptive values of moments in frequency 
distributions seems to' be that, in view of the formal incon- 
sistencies involved, it is necessary to carefully select the func- 
tions to which such measures are applied. The functions con- 
sidered above are suggested as well adopted for this purpose and 
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as probably sutiicient for all practical purposes. If they are 
adopted as fundamental the resulting approximations for the 
Pearson constants a”*! /‘’t 

are 




z ‘ 





and 



n(7i-Z) 


,(ILJ)L 

nfrj-Z) 




Mi 


It will be noted that the coefficient in the latter equation is 
very nearly unity for even moderate values of 77 , 



A SHORT METHOD FOR SOLVING FOR A CO- 
EFFICIENT OF MULTIPLE CORRELATION 


By 

Pawi, Horst 

The method which we present presupposes ft familiarity with 
tie Doofittle method ^ for solving normal equations. We start 
with the determinant 



/ 

; 

nt — 

^in 

( 1 ) 



1 — 

~ Otn 



In 

^77 

1 


where the elements are zero order coefficients of correlation, 
'Now the adjoint determinant of (1) may be written 





1 

1 

1 


(2) 

r- 

^IZ 



^zn 



^in 




where the elements are the cofactors of the elements in (1), 

From the elementary theory of determinants. * we know that 

(3) r. 


The adjoint determinant of r may be designated by M'R 
where 


(4) « r 


1 Mills, F. C., Statistical Methods, p. S77. 

* Boeher, Maxime, Introduction to Higher. Algebra, p. 33 . 
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Prom (3) and (4) we have 




Cn-lf 


or 

(S) 


/<- IP 


t 

n(v-Z) 


Hence the adjoint or r is obtained by multiplying each ele- 
ment of by ^ And if we write 




A„ 

>»/n 

(6) W- 


4^ 




1 ' 

1 

1 

' 5 

1 ^ 

4nn 

then 

,^ij 



(7) 


* 


so that (1) may be rewritten 





Alt 


(8) P “ 



-din_ 



.dlZL- 

Ann 


■jp'VZ 


The numerators of the elements in (8) are the cofactors of 
the elements in (2). 

L«et us now consider (8) as the coettidents of a set of normal 
equations whose constant terms are zero, and let us follow through 
literally the Doolittle elimination process. 

For simplicity we outline the reduction of a 4-variable problem 
as follows. 
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The -equations are the original equations with the co- 
efficients to the left of the diagonal omitted. The ^-equations are 
the product equations which are subtracted from the oC -equations. 
The ^-equations are the reduced equations which may be. repre- 
sented symbolically by 


(9) 


The fi-equations are the inequations divided by the negatives 
of their respective leading coefficients. That (9) is true may be 
readily proved from the theorem* 

= AAi/kt 

Where the notation indicates cofactors rather than minors. The 
proof is made more obvious if (9) is written 

/■- [ [{<4 -/», ) -/Sg) 

for each successive subtraction reduces the determipant by one. 

If we indicate the leading coefficients of the ^equations by 
we may prove that 

(11) fra 


( 10 ) 


Aij 

AU 




We have in the case of four variable;3 

{\2)', " J?‘A„zz 

Asr 

or from * 

vlTii 


rAuzzaa^ 



In the general case we have 

„r7-l 

'I’ti J^frr-s}n 




'Bocher, Maxime, Introdutction to Higher Algebra, p. 33. 
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Now let us consider Kelley’s equation for the coefficient of 
multiple correlation* with a slight change in notation to be con- 
sistent with the above, 

i(«) 

( * 

Obviously 

( 14 ) Fr,r, * 7 * 4 / 

So that (13) becomes simply 


(J5) 


But from (9) we have 


( 16 ) 



hence ^w-4?— *r An ^ 

But therefore we get 



1 ^” 


In other words is simply the square root of the last pro- 

duct summation. 

From (17) it is obvious that the solution for the coefficient 
of multiple correlation is considerably shorter than the standard 
Doolittle solution for regression coefficients. All of the back solu- 
tion work is eliminated, as is also the calculation of the last re- 
ciprocal. 

The only caution needed with respect to the order of the 
variables is that the dependent variable shall be the >^h variable. 

The usual summation check method may be employed exactly 
as in the solution for regression coefficients. 


* Keller, T. L, Statistical Methwl, p. 301. ea. 27S. 



A SHORT METHOD AND TABLES FOR THE 
CALCULATION OF THE AVERAGE ANto 
STANDARD DEVIATION OF LOG- 
ARITHMIC DISTRIBUTIONS* 

By 

Thomas N. Jsnkins 
N ew York Univeriity 

In fitting various types of curves to reaction-time data/ the 
writer was impressed with the enormous amount of labor and 
boredom involved in the calculation of the constants of logarithmic 
distributions. Besides the constant use of a set of logarithm tables, 
it requires the tedium of squaring large numbers on a machine to, 
compute the second moments of the distributions. In order to 
eliminate some of the labor involved in such a process, a short 
method was devised for the computation of the average and the 
standard deviation of logarithmic distributions. 

The short method described in this papei was originally 
developed to facilitate the work of fitting logarithmic normal curves 
to a large number of reaction^time distributions, but dispersions 
approximating this type seem to be sufficiently common in econ- 
omics and biology to warrant a more gen'ei'al fise of short methods 
in the computation of the constants of such distributions. In the 
field of economics, logarithmic curves have been fitted with suc- 
cess to distributions, of income and prices, and probably could.be 
applied equally well to distributions of capital. Many skewed 
distributions can also be found in the fields oi biology and psy- 
chology. Kapteyn fitted n logarithmic curve to a distribution lof 
the minimum weights necessary to produce a sensation of pres 

•A portion of the work invcdved in thk paper was carried out duriof 
the writer’s tenure as a National Research Rdlow, 

^ Cf. Facilitation and Inhibition. Areh. Ptychol, No.'i|36, 5d p. 
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sute.* Kapteyn attempts to show that logically the normal curVe 
is the exception and skew curves the rule. For example, if “the 
diameters of certain ripe berries'* are distributed in a normal curve, 
their volumes will be distributed in an asymmetrical curve; in 
other words, volume increase will be dependent upon size, so that 
volume changes are greater for large berries than for small ones. 
That skew curves are the rule can be shown analytically. Suppose 
certain quantities a are distributed normally, and any other quanti- 
ties « are expressed as functions ofe, thus, 

Then, 

(1) d%~f'(x)dx 


If the frequency curve for the k's is. 





then the frequency curve for the.x’8 is, 


It will be seen at once that the x’s cannot be distributed normally 
provided^ is a non-linear function of «. If we let z«logA:, then 
^ and equation (2) becomes 


( 4 ) 


N J 
y » — 

a/Zrr X 


. fflogje- M) ^ 


This is the lo|arithmic curve of distribution, the theory of which 
hs^s been treated by several writers, one of the first and most im- 
portant'papers on this subject being that of McAllister.* The study 

•}. C Kapteyn, “Skew frequency curves in BWogy and Statistics". 
Groninsen. p. 4243, 1903. 

•The Law of th« GecHnetrk Mean, Proe. Roy. Soe. 29:367. (1879), 
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of the properties of the logarithmic curve of error was undertaken 
by McAllister at the suggestion of Galton,* who saw the possibility 
of applying^ it to psychological and social phenomena. 

Dispersions approximating this type are illustrated by dis- 
tributions which' are definitely limited at the zero point, but a more 
definite presumption in favor of the logarithmic curve is indicated 
when the real origin, determined a priori or deduced from empirical 
considerations, does not correspond with '•he origin on the value 
scale. 

Reaction-time distributions are good examples of dispersions 
where a displacement of the origin is indicated by empirical con- 
siderations. A little reflection will show that there must be a 
physiological limit for the speed of reaction. It takes a certain 
minimum time for the neuro-muscular machine to do its work. The 
time it takes for the machine to do its work constitutes an un- 
disturbed region within which no deviations ever occur. Reaction- 
time dispersions approximate the logarithmic more closely than the 
normal curve of error. Investigations in the field of learning often 
give distributions which have origins other than the zero of the 
scale which can be determined a priori . . . .that is. the real origin 
follows inevitably from the conditions of the experiment. If the 
norm of mastery for learning a maze is two perfect trials out of 
three, then the criterion is such that an animal to learn a maze must 
make at least two perfect runs. In other words, the experiment- 
er’s criterion is such that no deviations could possibly occur under 
two trials. 

In using the short method for finding the first and second 
moments nf a logarithmic distribution, the computer must still re- 
sort to a table, but in this case it is only necessary to use a single 
page table :kistead of an extensive logarithm table. Furthermore, 
the labor of squaring the logarithms is eliminated. The short 
method can best be explained by following the process through an 

Geometric Mean in Vital and Social Statistics. Proc. Roy, Sot. 
29 : 365 . ( 1879 ). 
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actual example." This is illustrated in Table 11 on a distribution of 
i-eaction-times. Beginning at 70 (the real origin of the distribu- 
tion is assumed to be at 70) the step-intervals are numbered from 
zero to the end of the distribution. Under the log x column of 
Table I the value for each step is found and multiplied by the 
frequency for each step. This operation gives the values shovirn 
in the Flog* column of Table II. The sum of these value.s 
divided by /y (number of cases) gives the correction C . The 
average (log 6 , ) for the logarithmic distribution is finally found 
by adding a factor K to the correction C. The constant K depends 
upon the length of the step-interval. In this distribution, the 
length of the step-interval Is, ten. Looking under column K of 
Table I, wc find that the value of K for a step-interval of ten 
units is equal to .69897. The geometric mean (G) ot the dis- 
tribution is found by adding 70 to ( G/) 

The process of finding the second moment and standard 
deviation C<y^) is similar to that for finding the first moment 
and the average. In one respect it is simpler : no correction has 
to be added for the length of step. The Flog**! column is ob- 
tained by multiplying the value for each step in the log*x column 
of Table I by its appropriate frequency. The sum of these divided 
by N (number of cases) pves the crude unit moment. The sqiiare 
of the correction C is then subtracted to give the corrected unit 
moment around the average. The square root of the corrected 
unit moment around the average gives the standard deViation(&^), 
and the antilog of gives the standard deviation ratio 

The formula for finding the average of a logarithmic distribu- 
tion is, _ _ 

He, ■^*K-C*K 

• l?or those interested, the proof of the formulae for getting the average 
and standard deviation is given in an appendix at the end of this paper. 

• One would expect the geometric mean to be different if the ongin were 
taken at a point other than 70. An origin at 70 was assumed because it 
results in an extremely good fit to the distribution. In this case, 70 would 
correspond to the physiological limit below which no deviations ever occur. 
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TABLE II 


Time 

F 

Step 

F log K 

F 

70 

I 

1 

} 


80 

3 

2 

1.431363 

.682934 

90 

14 

3 

9.78SS80 

6.839826 

100 

40 

4 

33.803921 

28.567627 

10 

ss 

S 

S2.483338 

50.081832 

20 

60 

6 

62.483561 

65.069923 

30 

52 

7 

57.925054 

64.525229 

40 

46 

8 

54.100197 

63.626769 

SO 

33 

9 

40.604814 

49.962150 

60 

28 

10 

35.805100 

45.785901 

70 

21 

11 

27.766605 

36.713541 

80 

14 

12 

19.064189 

25.960237 

90 

9 

13 

12'.581460 

17.588126 

200 

7 

14 

10.019546 

14.341615 

10 

7 

15 

. 10.236785 

14,970255 

20 

4 

16 

5.965446 

8.896638 

30 

3 

17 

4.SS5541 

6.917653 

40 

1 

18 

1.544068 

2.384146 

SO 





60 

1 

20 

1,591064 

2.531486 

70 





\80 

1 

22 

1.633468 

2.668219 


400 


400)443.381100 

400)508.114107 





1.270285 




C « 1.108452 

C * 1.228667 




.698970 






.041618 


' 

log 

G,. 1.807422 






.204004 


For origin at 70 

■ • • • 

G,- 65:8 
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TABLE I 


Step 

logx 

(Log xf 

Step 

Interval 

K 

1 

.00000 00000 

.00000 00000 

1 

.30102 99956 

2 

.47712 12547 

.22764 46917 

2 

.00000 00000 

3 

.69897 00043 

.48855 90669 

3 

.17609 12590 

4 

.84509 80400 

.71419 06972 

4 

.30102 99956 

5 

.95424 25094 

.91057 87668 

5 

.39794 00086 

6 

1.04139 26851 

1.08449 87247 

6 

.47712 12547 

7 

1.11394 33523 

1.24086 97921 

7 

.54406 80443 

8 

1.17609 12590 

1.38319 06496 

8 

.60205 99913 

9 

1.23044 89213 

1.51400 45481 

9 

.64321 25137 

10 

1.27875 36009 

1.63521 07719 

10 

.69897 00043 

11 

l.?2221 92947 

1.74826 38633 

11 

.74036 26894 

12 

1.36172 78360 

1.85430 26993 

12 

.77815 12503 

13 

1.39794 00086 

1.95423 62678 

13 

.81291 33566 

14 

1.43136 37641 

2.04880 22253 

14 

.84509 80400 

IS 

1.46239 79978 

2.13860 79042 

IS 

.87506 12633 

16 

1,49136 16938 

2.22415 97018 

16 

.90308 99869 

17 

1.51851 39398 

2.30588 45856 

17 

.92941 89257 

18 

1.54406 80443 

2.38414 61255 

18 

.95424 25094 

19 

1.56820 17240 

2.45925 66473 

19 

.97772 36052 

20 

1.59106 46070 

2,53148 65837 

20 

1.00000 00000 

21 

1.61278 38567 

2.60107 17684 

21 

1.02118 92990 

22 

1.63346 84555 

2.66821 91953 

22 

1.04139 26851 

23 

1.65321 25137 

2.73311 16157 

23 

1.06069 78403 

24 

1.67209 78579 

2.79591 12465 

24 

1.07918 12460 

25 

1.69019 60800 

2.85676 27889 

25 

1.09691 00130 

26 

1.70757 01760 

2.91579 59062 

26 

1.11394 33523 

27 

1.72427 58696 

2.97312 72744 

27 

1.13033 37684 

28 

1.74036 26894 

3.02886 22909 

28 

1.14612 80356 

29 

1.75587 48556 

3.08309 65087 

29 

1.16136 80022 

30 

1.77085 20116 

3.13591 68471 

30 

1.17609 12590 

31 

1.78532 98350 

3.18740 26197 

31 

1.19033 16981 

32 

1.79934 05494 

3.23762 64129 

32 

1.20411 99826 

33 

1.81291 33566 

3.28665 48386 

33 

1.21748 39442 

34 

1.82607 48027 

3.33454 91850 

34 

.123044 89213 

35 

1.83884 90907 

3.38136 59785 

35 

1.24303 80486 

36 

1.85125 83487 

3.42715 74737 

36 

1.25527 25051 
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37 

1.86332 28601 

3.47197 20810 

37 

1.26717 17284 

38 

1.87506 12633 

3.S1S85 47414 

38 

1.27875 36009 

39 

1.88649 07251 

3.55884 72561 

39 

1,29003 46113 

40 

1.89762 70912 

3.60098 85775 

40 

1.30102 99956 

41 

1.90848 50188 

3.64231 50672 

41 

1.31175 38610 

42 

1.91907 80923 

3.68286 07246 

42 

1.32221 92947 

43 

1.92941 89257 

3.72265 73909 

43 

1.33243 84S99 

44 

1.93951 92526 

3.76173 49312 

44 

1,34242 26808 

45 

1.94939 00066 

3.80012 13980 

45 

1,35218 25181 

46 

1.95904 13923 

3.83784 31768 

46 

1.36172 78360 

47 

1.96848 29485 

3.87492 51187 

47 

1.37106 78622 

48 

1.97772 36052 

3.91139 06589 

48 

1.38021 12417 

49 

1.98677 17342 

3.94726 19240 

49 

1.38916 60843 

SO 

1.99563 51945 

3.98255 98299 

SO 

1.39794 00086 


and, G/* antilog(C ■^■K) 

where G, is the geometric mean measured from an origin which 
may be other than the zero of the scale. The geometric mean {6 ) 
measured from the zero of the value scale is, 

G + (displacement of the origin) 

The formula for finding the standard deviation around the average 

and, cr^ > antilog 


Sununary of steps in the caloilation of the average (logG,) 
by the short method: 

1. Banning at the origin, find the deviation of the nud* 

point of eactLstep<intervai from the origin~in4mits.olste{:| 
interval. . MJCmOl'i j 

2. Using Table I, finUlthe lojg X^of each stepid^vition anp 
weight it by its appropi^ate Fr ( frequency), n 
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3, Find the jmw of the Flog*x's, and divide this sum by N 

(number of cases). This gives the correc^fw C. 

4. Using Table I, find the value of « corresponding to the 
number of units in the step-interval. Add the itetorK 

' to the correction C to get the average (log G, ). 

Summary of steps in the calculation of the standard deviation 
(0^) around the averagef log G , ) by the short method : 

1. Using Table 1, find the log'je of each step-deviation 
and weight it by its appropriate frequency. 

2. Find the of the F logVs, and divide this sum by 

3. Then subtract the square of the correction C to get the 
steoni mit momnt around the average. 

4. Extract the square root of the second unit moment to ob- 
tain the standard deviation (c^). 


appendix 

Deduction of the Formuiae eoe the Short Method 
Let log G be the logarithmic mean, x, the -length of step, 

f, the frequencies for successive steps, and tt?,, Wg 

r 77 f^ the mid-points of the steps for origin at aero, Then 

die first nud-point, w„a at x/Z the second, tt^ is at 3xJZtXc^ 
For convenience, these items may be arranged in the form of 'a 


table. 

Midpoii 

(M) 

m, 



F )ogM 

F h^M 

f, 


f, %■ 

e 

fl /oq 

• 

ft log 

« ' 

s i 

^17 

• 









lOGARTHMIC DISTRIBUTIONS 


where G is the geometric mean, and is the standard deviation 
around logi?. 

We have, therefore, 

• • - • • • • + fr,lo9i^)}C 
N 

Stating the logarithm of each fraction as the sum or difference 
of the logarithms of its factors, we have, 


ioqG 


. ^ (logl-log2+log \log(Z7hl)-logZ y- log 

N 


= - /o^ £ 



!S£^pdi+logK-fog2 

Since 

, _ Lf loQX 

fogpc 

and, 

, ^ , Lf log Z 
jogZ’- — — 

Letting 

K* h^x - log Z 

and. 


we finally have. 



^^G*C*K 



^4 T. N. IBNKWS 

Where C is the correction, and K is the constant indicated in 
Table I. 


For the second unit moment around log(?, we have, 



Expanding again and collecting terms, we have, 


z:[f„ hgrzn-v] 



In Table I, (2 t?- 1) so that, 

log je • log (2 77- 1) 
and ( fog x)* « '[ /of (2 jf. /;] * 


For each step, 1, 2, 3 • . , .77, the corresponding values o£'3r'are 
1, 3, 5 .... 2 77' 1. Note that x as used in the table is not the 
same as x as used in the deduction of the formulae. 

The figures in TaUe I are accurate to ten places of decimals. 
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The log ifc column consists simply of the logarithms of odd numbers 
from one to one hundred. K was computed by subtracting log 2 
from the logarithm of each number indicated in the “step interval'' 
column. The (log ai^*column was computed by squaring fifteen 
place logarithms with the aid of calculating machines. This had to 
be done by indirect methods through the use of the simple algebraic 
relationship, (dfrJ^.a^+Zar-t- where & is the first part 
of the number and is the remainder. The table was computed by 
two different persons and checked on two different calculating ma- 
chines by each person. 



ON SYMMETRIC FUNCTIONS OF MORE THAN 
ONE VARIABLE AND OF FREQUENCY 
FUNCTIONS 

By 

A. U O’Toow 

^ National Research Fellow 


In a paper published in this journal^ the writer has developed 
a simple ditferential operator method for expressing any sym- 

( metric function of the lij variates x, .^j^as a rational, 

int^ral, algebraic function of the power sums s, , 3 ^, • ■ ; ^where 

M/is the weight of the symmetric function and 

k k, . ^ ^ 

S^^Zki - x, / . 

The transformation to moments is then simply a matter of 

recognizing that va'i^ * if is the k th moment of the r7vari* 

ates with respect to the origin' from which they are measured. If 

the origin is at the arithmetic mean of the 77 variates the prime may 

b^ dropped and then . 

In the above mentioned paper the variates are of the serial 
distribution type, but, of course, not necessarily integers. The 
extensions to the case of more than one set of variates and to 
frequdicy functions now suggest themselves. It is the purpose of 
this note to discuss these proUems shnultaneously. 

Suppose that two sets, of (^variates each, 

given and that ( i - 1 , 2 , .... ,77) are cor- 
rMponding pairs. Modifying the partition notation used in the 
previous paper the symmetric function to be considered may be 
written in the form fa, .)J.e. the 


sum of all such terms as 




X, X. 

* A 






< • ■ 


and Symmetric, Functiona of Symfartric Func- 
tfaos, VoL 11 . No. 2 (May, 1931 ), pp, 102 - 149 . 
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where 

are jpositive integers and where 


eZ/ > <a^ » > • • >><?. 


77^ z V # ■ • • m, ^ ... . 

e,g. (^‘^ZAn) , Uj 4k . Obvious- 

ly the order of the parts in the bipartite notation will be important, 
corresponding parts being equidistant from the left of their respec- 
tive sections of the partition. The double partition will be said to 
be of weight i+i* 0 , 77 , 'r, ^ .... in at and of weight 
\Kis b,m, -f -7 b^rpj - my. There will be no los.s of 
generality if it is assumed that iV/ $ 

By a procedure similar to that employed in the first chapter 
of the paper already referred to, it may be shown that any sym- 
metric function of the type defined above can be expressed as 
rational, integral, algebraic function of the symmetric functions 

> '^ig ' ' wj 

where 

^hk n • 

Moreover, the terms of this function will be isobaric of weight 
inx, isobaric of weight VA^iny, and hence isobaric of weight 
in X and y together, 
e. g. Multiply by itself : 


(x* X* y *-*- . . . y/"‘) 




(r^ 


ti 


* s 




>/ 2 . 
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each term being of weight 4 in *, 
It is possible then to write 

, n, n. 77, 


2 in y, and 6 in x and y together. 


where f stands for a rational, integral, algebraic function of the 
sums (or product moments) a./, /-A---, ^1, J*^, • * • 
isobaric as explained above. Suppose that a new pair of variates 
introduced. Obviously s^y becomes Sij 
Hence applying Taylor’s Theorem f becomes 

f* Ufid„ ^ ^ 


+ 




aU other terms being idehdcally zero, where d^y • and ctjy 

■ d^/da^/, iJ,. , M/> , y«y, • • Wg. ^ 

Using the multinomial theorem and collecting coefficients of «c- 
the above expression may be written in the form 




where 


0 ) 


C Du m clff , 

r D,% •dig, 
iDt, -dg, 


XD. 


7J 




■ <<*z * Clfl/2, 
L Dg! « dg! , 

A# » d,g, 
d. 


As 




Ak ■ <=<« + d,; dg,. 

A/ 


dgfts 
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+ <^tl ^13 YZ, 

^33 * ^53 ^ '' 

'*' ^S! '*' '^»l Z^' 

.^S! ' 


K ’2 


d"'- 




'■‘■aJ. 


a ->3 


K! 


KJ 


Xa( 






where X, i, + ^ 2^3 * • ’ • • • * ^ , 

\J,*X 2 jz->-Xj,-. “j, 

h.iz.is. .y/Jj:Js'‘-‘'X't>X'i.Xj,--htmg positive in- 

tegers. The inverse relation is given bv 




kj k^t 


Xjl 


where i- /,'•••, iv, , j^/, ^ , 

kiJi^k^Jg *k3/3+ =y, 

k, ■*■ k^ K 


^'/ » jg > . • • •. Z>. kg, kj,--- being positive in- 

tegers. 

The effect of the new variates of ,^on 

Ca7' •... b,^> . . . ; 

is to replace this symmetric function by 

(a^’ ag ^3/'* . - . bg^* 

+ . . . ) 

V.7 . . b,^'bg ... ;+ ... 

Hence rei^acing f hy (a7'cig”»a7 ^ * • • 4 ^Ag^* .. .^, then 

.(1* c^/S£i, r *^*D,g r ^ 3 .... 

* * > • • yya.^'a/^a/' • • - 6^' b^*'bg^* ■■) 



A. L. (yrOOLB 




. . . . ) 

(, 

. .br” b”'-' b,'"’ ■ ■■}*■■ ■, 

£)qua.ting coefficients of like terms in andydfit is seen that 

‘(a7\ • • A ■■■I 

' • ■ br'br^b ;^*- . . ; 

(A 

' ^(ar'c(,^ar'- ■ ■ brb7^>''br* ■ ■ ■'>, 

etc. and 

D^^far%^‘a7» ■ ■ - b,^‘4 . ..hO if both 

/? and/rarenot among <*/, <* 4 . a^, • • • and b, b^^pg, • • • 
IrespeCtive ly. Hence also 

■ ■ <**/», w'. . trbr’t^-)^ 

if tf.tgftj ,'- : -.f^- are not alt respectively less than or equal to 
Vf , r7g, > 7 j, • ■ V and also respectively less than or equal to 777,, 
THg, mg, • • ; 7 qjwhere t,, tg_, tg, ’ ■ ' ‘,tff are positive integers or 
zeros ; that is, if even one of the values of t is greater than the cor* 
responding 77 mm then the effect of the operation is to give zero, 
this last property of the D operators is important not only in 
minimizing the work of computation as will be Seen in the illustra* 
tions given below, but will be of fundamental importance in the 
theorem to be stated in the closing paragraph of this paper. It 
should be noted here that the multiplication of the operators is com* 
nmtative. 

To illustrate the application of the operators consider '. 
^YCZlJ.*)mCiSgAt where c, and c, are constants to be 

.determined. These are the only terms which satisfy the weight 
conditions. Operating on the left with and on the right with 
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gives (1.1) ^C/S,,, i.e. s„,c,s,„hQnct c, = 1 . 

Operating on the left with and on the right with d^+dud^, 
gives Ox c, hence c^x-t and thus^^^y./'^-s^, s„ - . 

b) ^ 5 

^4/ ®// * ^3/ ^g/^u %• 

These are the only terms which satisfy the weight conditions. If 
the expressions for all the symmetric functions of lower weights 
are known then to determine the constants it is sufficient to operate 
on the left with 


,I\j and on the right with theii 
respective equivalents in terms of d. However if the expressions 
for the symmetric functions of lower weights are not known then 
it is perhaps simpler to operate with 


^/g 




^ at ^g! > 


^3/ *^3g I ' ^4g ^g/ 


d„d. 


"^33 ’ '^63 on the right and with their respective equivalents in 
terms of i? on the left. and give and hence 


CjxO. 4fandj9^^ and and/l^^ give 


0x6cgs^j andcr.*0. Similarly 3^^ < 2 ^^ a), and 
give «/. dj, c(,^ and q, ) give c, - - Y . d^, d^^^and 

Cg*0. dg^,d^g^axiADgi(qj,“jqiDii~D*i /Z) 
give C^-ld,, d^g and Z)^ D^, } give c, - - i; 


inland y^3 

f2Dg,Dg,JD,^4DlM give c,xZ. Hence (JZlJ^) - 


%Z ®// " ^4i ^g/ '^3g ^3! * '^31 ^g/ * 

Suppose that y^«i , ^'^.<2', • •••, n. Then s/y is simply #/ 
and Dij,d[j have no meaning except when i’j and then i>// and 
dii become the operators i?/ and respectively, of the earlier 
paper. 

The operator relations for any number of sets of correspond- 
ing variates are now obvious. For instance, in the case of 3 sets 
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*i the result is 



/ * * ' 

,Wi, 

k*l,Z, ■ • • • , 


h,i,*hgiji*hgij + ‘i. 

“j’ 

h, k, *h;,k^* ^$^3* — ® 

‘■hh> ‘'3."-‘ JfJ, ./j .• • ■> • ^Z‘ ^3f-' 

being positive integers , u/(,u^,w^the weights inyy» respective- 


ly of the symmetric function , 

. - / - Z i J k 

^ijk ' ^ijk ’ 

Returning now to the case of two variates x> andy, suppose 
that takes on only integral values for . . , , ,77and that 

yi n f(xi)3i\K,K f(xi) is thought of as the frequency cor- 
responding to a: » XI If, further, b,^b^»btg~ • • - -ai 
then the operators developed in this note give the expressions for 
• )of the earlier paper when each serial X^-^is re- 
placed by generally, let y/ • represent 

the frequency of * in the interval . If x takes on only in- 
tegral values then of course - 1 , t - 1,2, 

Consider 


(32.11)«£ Xi Xj f(xi)f( Xj)Ax^ A Xj , i^j , 

, ^ I ' 

3g xff(Xi)Ax.. xff(xj)Axj xff^(Xi)£%l 

If the lower and upper bounds for X are respectively a and b then 
in the limit as w becomes infinite and the maximum Axi, inl,Z,'\n, 
approaches zero, the last summation on the right approaches zero, 
being an ordin ary frequency function. Thus in this limitinor 
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( 32 . 11 ) = 


b 

xi x^j f( %i) f( Xj)dxi dxj , i^J, 


'a ''a 




In general, under these limiting conditions, any summation ^ 
(xjAxi approaches zero, if r is greater than 1, k and r being 
positive integers. For let be the maximum A *:/ for specified 
77. Now x1f''fXi)~Mlorisl,2, , , , , ,n Hence 


which approaches zero with . This establishes the well known 
statement that, the values of x being independent, 








For, under the above limiting conditions, all those terms which 
contain a ^\mafji^sJ^xl'f^fXi)Axi with A” greater than 1 must 
vanish. By the last property of the D operators given in (2) it is 
seen that there is always only one term which does not contain such 
an and from this term arises the product of the definite in- 
tegrals. 



A GENERALIZED ERROR FUNCTION* 

By 

AwBRT WeRJHfilMER 

I. INTRODUCTION 

Given a set of observed values // ( /' * 1, 2, 3, . . . , 77 .) 
obtained from 77 observations assumed to be made on the same 
quantity, // under the same conditions. We seek to determine 
two functions ffP, jand <PfP, f/.)such that 

i(P,il)^0, ( r%l,2,3, . . . , 77 ) 

defines^ as a unique value assigned to the observed quantity; and 
<P(P to within infinitesimals of higher order the 

probability that if another observation is made, the observed value 
• will lie in the interval 

iti. 

Gauss determined the <p function to be the so-called normal 
error law namely, 

on the basis of the following assumptions, 

(a) The product is to be a maximum with respect to p. 

Thus 

♦Presented to the American Mathematical Society, December 28, 1931. 
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(b) The unique value p is the arithmetic mean of the observa- 
tions. Thus 

(c) The probability function is a function of (P~ i^i)- Thus 

<p{P. 


Poincare} on the basis of the first two assumptions only ob- 
tained the error function 


where 


<p(P,-ti)-9r^i)e 


W(P}-h \ACP) 

» 


dW 

dP 


^■P 


oLV 

dp 


=a 


In this paper we assume the unique value to be defined by 
a function satisfying certain conditions, and obtain on the basis of 
assumption (a) a more general error function ftom which the so- 
called normal error law, the Poincare function, and other forms 
of the error function as well as the Pearson curves are obtained 
as special cases. 


2. The unique value p. 

We now make the following assumptions : 

I- The unique value /o is defined explicitly as a function of the 
observed values in the region ai Thus 

where F is single valued, continuous with continuous deriva- 
tives up to the second order. 

lit The value of/o is independent of the order in which the obser- 
vations are obtained. Thus is a symmetric function. 

IIP The change in p due to a change in one of the observed valueii 
say fi ,ia& function ofp and 4 only. Thus 

0, • 0, 


'H. Poincare, Calcit! des Probobilittt (1912), p. 171. 
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IV' If p is regarded as a function of a single variable, say 4 . 
while all the others are regarded as constants, then with re- 
spect to this variable /t? is a monotonic function and is not 
constant in any portion of the interval in which it is defined. 
Thus 



for all i*s. 

We have then for the determination of the <P function the 
two equations 

0) ' 0 , 

( 1 ) 

which must be simultaneously satisfied for any set of values in 
the region defined. 


3 . The g fmciion. 

We will now show by means of the following theorems that 
if A" satisfies the given conditions, then there exists a unique func- 
tioa //^such that the equation 

is identical with equation (2). 


9 


THBORUML 


Given a function of rf variables, 



continuous with continuous nOa-vanishing first derivatives in the 
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region defined, such that , 

then }Jr^(F, x') must be in the form of a product of a function 
of F and a function of -ti . Thus 


Proof : 

We have 

(P'> ^ ^ 

and 


Hence: 

xV tJ(F,x'>) 
V/^ ^UFxn 




= n (n 


Integrating, we get 

^oglJf^(F,x‘)=jT^(FjdF-i-(''(x‘l 


from which it follows that 

V(F,x')^co(F)/3'{x^), 

THEOREM n. 
Given a -function of v variables, 

Frx'xfxf.. .,x"X 


continuous with continuous non-vanishing first derivatives in the 
region defined, then in order that there shall exist a unique func- 
tion (F) such that 
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it is necessary and sufficient that 


Proof : 


Necesiory conditions ; — 

If the ^ function exists then the functional matrix 




ula • • 

n 




■ • 


must be of rank one. Hence 





ui.1 / • . . 


Sufficient conditions : — 

We assume that 


- ^rxO’C^YxO. 

■ ' rjcJ 

Then we have the following identities: 

a* 


a) 

t) 






fit, fit , 

for^, (</ or y and 


■ i^ii ''t) • Pi \ Pi,t Pi - I’m Ph ), 
where for convenience of notation, 

/ 


etc. 
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Making use of a), b), and c), it is easily sliown that the 
functional matrix 


h 



F, 

is of rank one. It follows that 

fi'ij 


F 


d) 


FiFj 


)^(Pl 


Now the differential equation that defines the Z function is 


or 



^ 4-^ (/?) from d). 


Hence f {F) is uniquely determined, namely, 

where F and H are constants of integration. 

Now, for our problem, if F satisfies the given conditions, 
we can apply the two theorems in succession and we have that 
there exists a unique function 

Z(P)^ uUfA. 


But due to the eynimetry of F all the u‘ functione will be the 
same and we have 

l(F)n udi). 

If we now define 



7a 
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we have 

4, General Error EuncHon 

We may now write equations (1) and (2) in the form re- 
spectively I 

■^■{og(pfp,-ti)^0, 

if 

These equations must be simultaneously satisfied for an arbitrary 
set of values in the region defined. It follows that they are 
identical. Thus 

fp iog (PCp, ) » lkCp)g(p, ii), 

where \jf {p) is an arbitrary function. 

Integrating, we get 

( 3 ) 

where 0^ f/) is an arbitrary function. This is our general error 
function. In order to insure a maximum we must have 

(4) ^('p)gp * O. 

S, A Generalieed Normal Eunction 
If we now make the additional assumption that 



A GENERALIZED ERROR PUNCTION 


n 


Expanding and simplifying, we get 

9p 

Differentiating with respect to 4 . we get 

Integrating and substituting in (3), we get 
ce 

From (4) we have 

K9p>o, 

Hence 

( 5 ) (p(g)^ce-^"^\ 

We shall refer to this function as the “Generalized Nornml Error 
Function”. 


5. Application to Special Cases 
If p is defined as the arithmetic mean, then the region con- 
sidered is - 03 < < + <w>, and 

g fp. 

The normal law is obtained directly from (5), and from (4) we 
have 


0(P. 
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where 


dV 

dp dp 


■=o, 


which is the same as the Poincare' function, 

For the geometric mean, the region considered is 

0 < 


and 


gCpJO^^oq p- fogfi-. 


Hence, from (3) 

0(p, fi)-6({0e 

0 

0\iog p- fog lii^ce 

The Geometric mean as the most probable value, as well as its 
generalized normal curve are used for certain astronomical photo- 
metric measurements.' 

For the harmonic mean, the region considered is 
(d < -f I ■<. oa) 


and 

g(p. 

Then from (3), we have , > 

' and from (4), we have . * 

7. Retnarks About the Generalised Normal Curves 
Let us consider briefly some characteristics of the gener|dized 
normal cums corresponding to the following three special cases. 


» Whittaker & Robinson, Cafculns oi observations (1924), p, 21& 



^ gbneralizbd error function 
(aj Arithmetic mean: Here 

From this equation we see that 




^(p.P^e^)‘(pCp^p.eV, 
0(pj) ’Wp^e^ ^^6V, 
CpfpO) 

» 

0(p^ os) = O. 

(b) Harmonic man: in this case 

(PfpJ) ^ 

from which we see that 

(J>(pp - <P( p, pt 6^) 


(Pfp a 

< (P(p^e^ 

<Pfp OJ 


(PCp,<») 

-A* 

® ce p* 

(c) Gemttric Mean: Here 

4(p.t) 


and 




< ff A eV, 

(pfp, 0) 



a 


Instead of treating thete itortmi fttuctioai as three dittittct 
error laws referred to the mms. uwasudng sode, we otn r^rd 
them as a single error law with referiwe to three diflfereot aMattw* 
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ing scales (seelsketch). This viewpoint helps to explain the above 
mentioned characteristics of these laws. 

The law for the arithmetic mean applies when an object is 
measured with a uniformly graduated scale inp. The char- 
acteristics for this law follow directly from the consideration that 
the scale is everywhere the same. 

The law for the harmonic mean holds when an object is 
measured with a reciprocally graduated scale, as for instance 
measuring the volume of a gas with a pressure gauge graduated 
for volume. In this case the scale becomes crowded as p in- 
creases, and hence 

and also 

0(p, p*€^) > <P(p,p- 

For large values of p it would take only a small error in the read- 
ing of the scale to make an infinitely large error in the value of p 
and hence (P(p, «a) does not necessarily vanish. On the other 
hand the zero point is at an infinite distance and hence (p(p,0)=‘0. 

The law for the geometric mean holds for measuring objects 
with a logarithmically graduated scale. The same remarks as for 
the harmonic mean apply here, except that in this case it would 
take an infinitely large error in the reading of the scale to make 
an infinitely large error in the value of p . Hence 0(p, a>)«0. 

8. The .Pearson Curves 

Leaving out the subscripts in (3), we have for the general 
error function 
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Remembering that 

we have 

d(p 


75 


di 


t)fffp) cfp J , 


Thus f or a given p the curve approaches the f axis asymptotically 
Let us now impose the condition that 

d(Pi 


di 




’O, 


then 


Integrating, we get 


6f{) * CA 


'fufOVHe)df]d^ 


so that 

(6) (p(p, i)^cA / )dp 

and 

d<P . rfi 

where <■ is a variable of int«fali«. If wc now ttfct i* § fpetJit 

case 



% 
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we have 


^ bo* bj * 


which is the differential equation defining the Pearson system of 
frequency curves. For this case, (6) reduces to 




<^pj)^ce 


from which we see that by a proper choice of (f ip) we can 
choose the value of p for which the product X( (p(p,-{i) 
shall be a maximum. 

It may be noted that the differential equation defining the 
Pearson curves is often derived on the basis of the assumptions 
that the curve shall approach the t axis asymptotically, and have 
only one maximum point. 

In conclusion, it appears that if we restrict the function that 
defines to satisfy the assumptions given in this paper, and also 
impose the condition that/? shall be the most probable value in the 
sense, that the product 

fr4>(p,{i) 

shall ha a imlmum with respect toytt, then (3) is the most general 
form of the error function. 




Harmonic Mean-— Reciprocal Scale 
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A COEFFICIENT OF LINEAR CORRELATION 
BASED ON THE METHOD OF LEAST SQUARES 
AND THE LINE OF BEST FIT. 

' By J. B, CoiEman 

Given N points in a plane, corresponding to N pairs of values 
for two variables, X and Y , we find the line of best fit and 
the line of worst fit, by the method of least squares*. Then we 
derive a cofficient of correlation based on the sum of the squares 
of the distances of the points from these two lines. 

The line of Iiest fit is in the line such that the sum of the 
squares of the distances of the points from it is a minimum. The 
line of worst fit is the one from which the sum of the squares of 
the distances of the points is a maximum. We shall refer to them, 
respectively, as the minmum and imxmum lines. 

For convenience we take the origin at the centre of gravity 
of the points, letting x and y denote deviations of X and 
Y , respectively, from their arithmetic means. 

1. The two lines pass thru the arithmetic means of the X*3 , 

and of the Y’s . 

y a n7if + b may represent any line of the plane. The dis*- , 
tance, di , of a point, ( , from the line is 

y . - mxi - b * 

squares of the distances, of 

the N points from the line will be 


*For a general discussion of this method of fitting when 3 vara- 
bles are involved, see, Pearson, fCarl, “On Lines and Planes of Closest Fit 
to Systems of Points in Space”, Phil. Mag., 6th series, vol. ii, 1901, P. SS9. 
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'( 1 ) 




Nb^~ EmL^y- ZbZy * 2mbE x 


Using 0 ^ in the condition for tnaximum or mini- 

mum values in (1), the condition reduces to , and the 
theorem follows. 

.2. To find the slopes of the two lines, 

Kquation (1) now becomes 


(2) Zd^ 

The condition under which (2) will have a maximum or mini- 
mum values, is that 

m^Exy *m(Z x^-Zy V-Zxy=0, 

This condition is satisfied by two values of tv , namely; 


<3) 


1 


rr?, 


Zy‘-£x% urixy)’ 


f 


1 ( 4 ) 


j(Ex^-Zy*)*^ 4(Zxyy 

2Zxy 


It is found by considering the second derivative that (4) is the 
condition under which Ld* will have a tnaximum value, and 
(3) is the condition for a minimum value. 
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The equation of the mimmum line is y=Tn,x,, and that 
of the maximum line is The value of n), and 

777^ are those given in (3) and (4). 

3. The maximum and minimum lines are perpendicular to each 
other. 

That m, = is easily shown from (3) and (4). 

Further tt?, has the same sign as T.%y , and rrig^ , the 
opposite sign, since their numerators are, respectively, positive and 
, negative. 

4. The minimum line lies between the two lines of regression, of 
coincides with them. 

If Zlxy 0, 

^ ~ ZZxy 


since Ey^ = (Exy)^. 
Hence rn, iZ y®/ 


gression of a; on y . 

Rationalizing the numerator of (3), and noting thatZ'j:*.ZJy*:r 
(Zxy)^ , we obtain 




ZZxy 


Zky 


Zx^-Ly^^JrFxJ^Zy^?T4Z>jLyi 




the slope of the line of regression of y on x: . 

In the same way it may be shown that if Exy< O , then ?77 ^ 
777 /= ^yx- 

The condition that be equal' to the slope of one line of 
regression is the same that it be equal to the other, so that the 
minimum line coincides with both lines of regression, or else lies 
between the two. 
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5. To find the sum of the squares of the distances of the points 
from the minimum line ; also from the maximum line. 

Let £ 2 !' be the distance of a point from the line y* 77?^ x 


Substituting for m from (3) and reducing, we obtain 

(S) 

Lci^^]fx^4Zy^- fZl 4(Zxy) ^ ] A 


Similarly; if D represents the distance of a point from the 
line y = 777^1^ , we obtain 




6. To find a coefficient of linear correlation. 

Let q^JZd*‘/ZD^ Substituting from (5) and (6), 
and reducing, we obtain 


(7a) 


Lx^ty^- jCZx^-Zy^)^-^ 


or 


(7W q 


. (ZxyY 


Ex^*Zy’^■^ gLii^-Zy^)^+4(£xy)^ 


q represents the ratio of the root-mean-squares of the distances 
of the point from the minimum and maximum lines, This ratio 
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is a measure of the closeness of fit of the points to a line, and 
should furnish a measure of linear correlation. The value of 
may vary from 0 to J. Q ~ 0 indicates that the points are 
all on a straight line, hence that the correlation is perfect. It is 
of interest to note that when * <9 , (7b) gives T,xy/Ncf^^ 
[=r] = i: i When < 2 ' is 1 the mean squares of the distances 
of the points from two lines at right angles is the same, and linear 
correlation is lacking. Hence 1 -q would furnish a coefli* 
cient conforming to the customs that it have the value I for 
perfect correlation, and 0 for lack of correlation. 

Values of q found from (7a) or (7b) would involve the 
units in which X and 'Y are given. Hence these forms would 
be objectionable, in that q could be made to assume different 
values for the same data, by changing the units in which x- and 
y are expressed. However, this objection may be removed by 
taking and cTy as units in which to express X and . 
(7b) then reduces to 

N*\Zxy/a^cy^\ ' 

The coefficient 1- q may now be expressed as 


( 8 ) 


Ia/- \Zxy/<^^ <^y I 


The sign of the coefficient should agree with that of the slope 
of the line to which the points are fitted. Hence, when the value 
for q has been found it should be given the same sign! as 
the slope of the minimum line. But the slope of the minimum 
line is determined by that of ' Z xy , so that the sign given tq 
l~q should be that of 71 xy. 
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The coefficient I-q may be expressed immediately in terms 
of the Pearson coefficient, r , which is equal to Il.xy/Ncf^ d^. 
Making this substitution in (8) we have 


-y- 



In the table are shown values oi l~q corresponding to some 
given values of r . The values for 1-q^ have also been listed 
corresponding to the same set of values for r . The maximum 
difference occurs between l~q and r when ^“.839 and 
704, a difference of .135 by which 1-q is smaller. 


r 

rc-l-q 

1-q^ 

1 

1 

1 

.99 

.929 

.995 

.95 

.840 

.974 

.9 

.771 

.947 

.839 

.704 

.912 

•8 

.667 

.889 

.7 

.580 

.824 

,6 

.500 

.750 

.5 

.423 

.667 

.4 

.345 

.571 

.3 

.266 

.462 

.2 

.183 

.333 

.1 

.095 

.182 

0 

0 

0 



/. 8 , COinUM 


85 





A STUDY OF THE DISTRIBUTION OF MEANS ES- 
TIMATED FROM SMALL SAMPLES BY THE 
METHOD OF MAXIMUM LIKELIHOOD 
FOR PEARSON’S TYPE II CURVE 

By John L. Cajuson 

The object of this paper is to study the distribution of esti^ 
mates of the parameter of location for Pearson’s Type II Curve, 
estimated by the method of maximum likelihood from small 
samples. 

R. A. Fisher has assumed/ and Professor Hotdling has 
proved® that in large categories of cases the distribution of an 
optimum statistic approaches normality as the sample size in- 
creases. This normality has been assumed to hold for optimum 
statistics in general whether calculated from large samples or 
small ones, and it has also been assumed that optimum statistics 
have minimum variance and always give better fits than do sta- 
tistics calculated by the method of moments. That this is the case 
whenever the sample is large and the distribution of optimum 
statistics normal is made plausible by the reasoning of R. A. 
Fisher.* In case the sample is" small, however, there may be reason 
to doubt that the normality of distribution of optimum statistics 
holds, and that the other conclusions hold. It is with this phase of 
the subject that we shall be concerned in what follows. 

Before entering into the topic under discussion it will be coti- 
vehient to review some of the more elementary facts regarding the 
curve with which we are to be concerned. We shall take first the 
general equation for the curve in the form. 







Foundations of Theoretical Statistics, R. A, 
Series A, Vol. 222, 1922. Pp. 309-3158. 
ti ij TT ^ Ultimate Distribution of Optimum Statistics,. 
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and determine the effect of variation of the constants, 

When the equation reduces to the straight line y=^ 

When = / 1 the equation is that of a parabola with 
y=y^ at the point = w and the x intercepts at the points 
X = t a . It of course meets the x axis at an angle. 

, When p= V- 2 the equation y=0 is of the fourth degree 
in X with double roots at the points x = t a. 

In general when p^n the equation y=0 is of degree 
Zr\\tix and has two sets of :?7-fold multiple roots fa. 

Since our curve is to be a probability curve it will of neces- 
sity have unit area, and this fact makes it possible for us to 
evaluate in terms of the parameters a aiid p . In order to 
do this we shall perform the integration below. 

t-a 

ydX‘ 


Area = i 


'x-a 



I 





whence 


but now since 


( 2 ) 


I: 


cos ^ Odd 




we have 

r( 


„ .JlLeiiL- 

° affrfpfj) 


and so 



DISTRIBUTION OP MEANS 


tS8 


Therefore 


( 3 ) 


ajff r(ptl) 





P 


Formula (2) we shall find to be of value a numljer of times 
and formula (3) is the form in which equation (1) will be used 
throughout the remainder of the pai^er. 

It will be worth while now to consider the likelihood function 
L together with its first and second partial derivatives with re- 
spect to 777 . (We shall hereafter refer to tv as the parameter 
of location, a as the parameter of scaling, and p as the para- 
meter of shai)e.) We are to use tv to denote the estimate of tv 
obtained by the method of maximum likelihood, in accordatice 
with the convention introduced l)y Fisher,’ and it will be with this 
parameter that we shall concern ourselves in this investigation. 

We have from (3) on the precedii% page 






and so 


( 5 ) 


- ipl 


xi-m 




t'./ 



>Ibid Fp. 309-m 
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At tin's ]ioint we shall stop to consider the effect of variation 
of the i)arameters a and p upon our estimate of m. Lict us 
first consider p . Since the method of maximum likelihood is 
here merely the method of the diffeiential calculus it follows from 
a consideration of equation (5) that our estimate of in will be 
independent of/0 . for any particular sample. Such is not the 
case when we consider a . however, for any change in a allows 
a change in the variance of yn for the particular sample. 

We shall find it advantageous to cover as much of the the- 
oretical work as possible liefore eml)arking upon our experimental 
check and its great amount of numeral calculations, for it is only 
by means of a check between theory and experiment that we are 
able in the present state of knowledge, to judge of the applicabil- 
ity of the method of maximum likelihood to small samples. Of 
course it will be necessary to consider the distribution of our 
estimates of W . in order to make this statistic of practical use, 
and it is desirable to know the theoretical variances of x the 
arithmetic mean, in, and the experimentally obtained variance of 
the distribution of our estimates of w . The first of these we can 
obtain from theory, the second from an approximation valid only 
in the limit, and the last by means of calculation based on actual 
sampling. 

We shall be concerned first with the theoretical variance of 
X . 'It is well known that the variance of ^ is equal to the 
variance of the distribution divided by n . We must first, there- 
fore, find the variance of the distribution. 

From (3) pageSSiit follows that 


cr 









airrrCp:^/) 



ap-t'i 

e 


-cos Q\ciO 
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remembering now (2) page 87 

^ ■ ^p^■7 
whence it follows that 


( 7 ) ' 


We shall now calculate the limiting form of the variance of 
777 . Fisher has proved*^ that if the distribution of optimum sta* 
tistics is normal the variance of an optimum statistic is equal to 
the negative reciprocal of the mathematical expectation of the 
second partial derivative of the logarithm of the likelihood with 
respect to the parameter in question. 

We may write, ^erefore 
' 1 ^ ~4r)prfpf^) 
d-% <%yffr rC/o+i) 



<4, 


- 4 nprfp+i) 




_ „ . , , / CZcos'^^ %- cas 

r{p-h I ) y 

and So again referring to (2) page 87 we have 


whence 


cr%j a^fp-l) 
hence we have- 


( 8 ) 


z 

m 




^l£zlL 


rip(Zp^l^ , 

The efficiency of the mean is then 

(pdKiPiIl, ^ 


pC2p^l) 

The next problem with which we ihust concern ourselves is' 
tliat of experimental verification of the assumptions under discus* 


‘Ibid Pd. 327.328 
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sion. The problem is briefly that of choosing a number of small 
samples from a population which obeys our law .of frequency, 
estimating 777 for these samples, and then calculating the variance 
for the distribution. Also we shall draw an histogram of the dis- 
tribution and observe the general type of the distribution, so near- 
ly as that is possible from our samples. 

The problem of choosing our samples is not the least of our 
difficulties, for we can not take all types of samples. They must 
be of a very special nature : they must be from a population of the 
Type II. In order to accomplish this it will be necessary to have 
a table of areas corresponding to given values of x. for the Type 
II Curve. There is no such table available to the knowledge of 
the writer, and it is therefore necessary to construct the table be- 
fore we can precede with the choosing of the samples. After the 
table has been built, we can with the aid of Tippett^s Tables, ‘ 
choose our samples with ease. The manner of choosing is as fol- 
lows. Take the numbers from Tippett’s Tables as areas under 
the Type II Curve and look up in the table of areas the values of 
X corresponding to the smallest area containing the area found 
from the Random Numbers. This will give the value of x to be 
taken. Since we will take four digits let the fifth digit determine 
the sign. If it is odd take the sign - , if it is even take the sign v- . 

There are two ways in which a table of this nature can be 
prepared, and the method employed must in any case be de- 
termined by the degree of accuracy attainable and the amount of 
labor involved. One of these methods is that of the calculus of 
finite differences, determining the zero order differences by means 
of algebra, and from these by the process of addition building up 
the table. This method is best used when a dependable listing 
adding machine is at hand. The other method is that of direct 
integration, and it is found that with the aid of two calculating 
machines this is by far the quicker. It was this method that was 
applied in the building of the table on page 92 and 93. 


‘Tracts For Computers, No. XV. 
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Table of Areas Under Pearson’s Type H Curve, Correct t'o 
9 Places of Decimals. The Areas are included between ordinates 
located - ;«! units from the parameter of location . 

Constants, <3'/ 


X 

wmmi 

X 

Area 

0.00 

0.000 000 000 

30 

529 661 250 

01 

018 748 750 

31 

545 084 843 

02 

037 490 001 

32 

560 298 291 

03 

056 212 259 

33 

575 295 077 

04 

074 920 038 

34 

590 073 828 

05 

093 593 867 

35 

604 625 820 

06 

112 230 292 

36 

618 947 482 

07 

130 821 880 

37 

633 '034 148 

08 

149 361 229 

38 

646 881 319 

09 

167 840 964 

^9 

660 484 657 

10 

■riii'wrf 1 

40 

673 840 000 

11 

204 592 289 

41 

686 943 3Sa 

12 

222 849 331' 

42 

699 790 921 

13 

241 017 673 

43 

712 379 067 

14 

259 090 168 

44 

724 704 358 

15 

277 059 727 

45 

73^*763 555 

16 

294 919 322 

46 

748 553 612 

17 

3l2 661 995 

47 

760 071 688 

18 

330 280 859 

48 

773 151 488 

19 

347 769 104 

49 

782 281 572 

20 

365 120 040 

IHHESH 

792 968 750 

21 

382 326 904 

51 

803 374 696 

22 

399 383 261 

52 

813 497 651 

23 

416 282 613 

53 

823 336 081 

24 

433.018 598 

54 

832 888 688 

25 

449 584 961 

55 

842 154 414 

26 

465 975 552 

56 

851 132 442 

27 

482 184 334 

57 

860 009 702 

28 

498 205 389 

58 

868 223 379 

29 

514 032 918 

, 59 

876 335 911 

30 


ar 
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61 

892 546 290 


81 ■ 

985 203 165 

62 

898 944 981 


82 

987 317 441 

63 

905 907 620 


83 

989 230 274 

64 • 

912 585 318 


84 

990 949 478 

65 

919 120 273 


85 

992 483 242 

66 

925 092 472 


86 

993 840 132 

67 

930 925 941 


87 

995 029 095 

68 

936 482 509 


88 

996 059 469 

69 

941 764 926 


89 

996 940 979 


946 776 250 


90 

997 683 750 

71 

951 519 851 


91 

998 298 304 

72 

955 999 411 


92 

998 795 571 

73 

960 294 310 


93 

999 186 888 

74 

964 182 748 


94 

999 484 008 

75 

967 895 508 


95 

999 699 102 

76 

971 362 202 


96 

999 844 762 

77 , 

974 588 156 


97 

999 934 010 

78 

977 579 039 


98 

999 980 299 

79 

980 340 865 


99 

999 997 519 

86 


J 

100 

1.000 000 000 


The table on the preceding page was built by direct integration 
of (3) page 88, with and a*/ These values for the 

parameters were chosen so as to save as much labor in calculation 
as possible, and at the same tinie maintain the desired shape of the 
curve. There has of course been no less Of generality in setting 
a»i but we have limited ourselves quite definitely in using the 
value ps2 . The accuracy of the table to 9 places' of decimals 
has been assured by calculating all values to 13 places and then 
determining the maximum error over the whole range which was 
625 in the 13 place due to the use of the decimal equivalent of 
in the third degree term. 

Our table of areas being complete and the problem of sampling 
thus solved, we must next consider the task of estimating tv. 
Since we have already taken a - J and p -2 in building our 
tables we shall continue to use these values throughout the work. 
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distribution op means 


The next question to Ijc settled before beginning our work is the 
size of the samples with which we are to deal. The case 77 = 1 is 
of course of no interest for the best estimate of a single observa- 
tion is the observation itself. The case « 2 is likewise of no 
interest for in this case the arithmetic mean coincides with the solu- 
tion by the method of maximum likelihood. Therefore it is the 
case '77-3 with which we shall be concerned. Our results are 
not then trivial, and at the same time we have the case which is the 
easiest to deal with, since the number of numerical calculations for 
each sample is reduced to the minimum. 

Before going ahead with any attempt at solving an equation 
such as (S) pageBfi., it is well to have in mind a picture of what we 
are actually trying to accomplish. With such a picture in mind we 
are better able to realize the difficulties of the situation and so are 
lietter able to cope with them. To this end we have included a 
graph of the problem inyolved which will make clear at a glance 
just what must be clone to find the true value of m. 

We have drawn. i)age 96 , from ijlotted points the curves 

representing L , and the three terms of , Also we 

have drawn in the asymptotes to the curve which are of significance. 

If we are able to find the point at which we have the 

solution to our problem. This involves solving a fifth 'degree equa- 
tion. We shall use Newton’s method of successive approximation. 
Fisher states^ that in some cases at least, we may start with an in- 
efficient statistic and by a single approximation obtain an efficient 
one. Whether or not’ this is the case for small samples we shall 
see when our calculations have been analyzed. 

It will be seen upon examining the graph that as is allowed 

to vary each of the therms of ^ varies from -otf to oe , 

V 1 Statistical Estimation, R. A.. Fisher, Proc. Cam. Phil. Soc. 

Vol. XXn part S. Pp. 5!08-709. 
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and so their sum also varies ktween these limits. It will be seen 
that the asymptote corresiwnding to the largest allowable value of 
wean be found by adding the value of a to the smallest observation, 
and that the asymptote corresponding to the smallest' allowable 
value of m may be found by subtracting the value of a from the 
greatest ‘Oljscrvation. It is thus evident that as dispersion in the 
sample increases the variance of m must surely decrease. That this 
fact is of fundamental importance in choosing our first estimate 
of m will be seen from consideration of the following case, since 
it is well known that in the case of a curve with a real finite pole 
Newton’s method may lead us to erroneous results. Let us con- 
sider the sample consisting of the three observations .99, 

a 1 ^' .99, .j;j«vt.99, for which the arithmetic mean is 
£ + .33. If now we take as ouf first approximation 

we will immediately lead ourselves to an erroneous value of , 
fmr estimate of . That this is the case will be easily seen by 
means of the check given on page 94 for using £2 « 1 we locate 
the asymptotes at x - 1 .01. The true value of m located be- 
tween these asymptotes, is not therefore even to be approached 
.should we take m, = x , This is an extreme case and fortunate- 
ly not to be anticipated very often, or the arithmetic mean would 
be deprived of any value whatsoever. We should always be'sure 
that the difference between any observation and the value of m 
that we arc using is not greater than the value of a when’ dealing 
with the Type II Curve. This holds no matter what our manner 
of attack may be.> ' 

We shall now be concerned with the calculations for our 100 
samples of 3. All of the data are tabulated in such a way as to be 
self-explanatory, and- so we shall not bother to give sampld cal- 
culations. The next ten pages cover these calculations. The dis- 
dSssion is continued on page’ 104. 



Vaht€S ofL 






JOHN L. CARLSON 


97 



1 


777/ 

/ 3 



h(x,-Tn)^ 


mg 


hi Hxi-n^ 



N-L/dni^ 


n 

mT 


+ .254667 




■Mi 

1 

2 

-23 

-.0700 

- .164200 

+ .009840 

SiiKikiijI 

- 3.184900 



3' 

-IS 


- .080520 





■ 

m 








2 

2 

-.15 

+.0533 

BKI 

4^.016730 

SiemS 

- 3.-438830 

+.049737 


3 

+.34 


+ .312676 


-1.284716 

' 



■ 



+■ 554668 


iim 



3 

m 

nl 

+.4300 

- .762281 

-.177943 

-2.573777 

- S.080876 

+.46S022 ' 


i 

Swi 


+ .329670 


- 1.316266 




1 

S!9 


- .291230 


-1.248262 



4 

9 

!kCI 

-.2700 

-.060216 

+1018882 


-3.655622 

-564835 


9 



+ .370328 


-1.396495 




9 

SI 


-.481688 


-1.718S38 



S 



+.0067 

+ .104445 

-.029803 

SIMM 

- 4.101439 

+.013936 


9 



+ .347440 






■ 

WM 


-.408701 


-1.479856 



6 


SiK+l 

+.0967 

+.023312 

-.024353 

smmm 

-3.858903 

+.103011 


i 



+ .361036 


-1.377417 




m 

+.54 


+1.7704SI 


-4.424677 



-7 

2 


-.2167 

- .308021 

+.853498 


-9.978709 

-.131268 


3 

-.69 


-.609932 






1 

M9 


+ .815850 





8 

2 

-.64 

-.3700 

- .291230 

+.207992 

-1.248262 

-4.758692 

-.326292 


3 

-.66 


- .316628 


-1.292329 




1 

-.12 


+ .361036 


- 1.377433 



9 

2 

-.48 

-.4433 

- .036749 

+ .011911 


- 3.666198 



3 

-.73 


- .312376 


- 1.284716 


im 


Di 

-.49 


+ .167774 





10 

2 

-.77 

-.6533 

- .118311 

+ .002661 


- 3.132061 

-.652450 


3 

SkI 


- .046802 






r 

R9 


+ .066896 


|||nf|T|(Xi^ii 



11 

2 

9^ 

-.1367 

+ .18228S 

-.009546 


-3.308885 

-.139555 

_J 

3 

-.38 


- .258727 






m 

Mil 


+ .128735 





12 

2 

-.72 

-.5267 

- .200836 

-.005134 

-1.119217 

-3.182127 

-.528283 


3 

-.46 


+ .066967 






1 

-.30 


- .036749 





13 

2 

-.65 

-.2633 

- .454723 

-.387058 

-1.S89322 

-4.343556 

-552411 


3 

+.16 


+ .104414 






1 

-.14 


+ .128767 





14 

2 


-.2667 

- .246729 

-.010034 

-1.179312 

-3.263601 

-.269775 


3 

-.16 


+ .107928 


-1.034813 
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Sam- 

ple. 

nJ t 








dL 

3 _ 

£ x; -rr> 


HUrTv)^ 


TlHxrmf- {J'Cyii-7n)^fi 




0\ 

dwi^ 

.n /I 


Wn 


WtS 


IS 


-.09 

.20 

-.43 


.2367 


+ .149926 
+ .036749 
- .183962 


+ .002713 


- 1.066950 

- 1.004049 

- 1.106718 


- 3,177717 


h .235846 


16 


-.29 

+.12 

♦.27 


+.0333 


- .360623 
f .087663 
+ .251104 


.021856 


- 1.376579 
1,022996 
-1.185618 


- 3.585193 


+.039096 


17 


-.10 

-.39 

-.23 


-.2400 


+ .142798 
153452 
+ .010001 


-.000^53 


■ 1.060774 
■1.070113 
■1.000300 


-3.131187 


-.240209 


18 


♦-.26 

-.42 

-.17 


-.1100 


+ .428687 

- .342958 

- .060216 


+ .025513 


■1,526159 
■1.341557 
■ 1.010865 


-3.8785S1 h. 103422 


.19 


+ .23 
1-.21 
-.43 


.1367 


+ .423670 
073695 
- .320905 


+ .029076 


1.514353 

1.016264 

1.300082 


- 3.830699 


.1291 


zal 


+ .37 
-.09 
.67 


+ .3167 


+• .053552 
.483174 
+ .403835 


.025787 


1.008563 
1.673112 
. 1.468552 


- 4.15D227 


21 


1 + ,21 

76 

1+.66 


+.0367 


.178665 

-2.181131 

+ 1,019301 


- .983165 


- 1.094805 
•12.252377 

- 3.713282 


-17.060464 


- .094328 


-.20 

t.23 

-.22 


-.0633 


- .139303 
+• .320905 
,160644 


+ .020958 


■ 1.057853 
•1.300082 
• 1.076786 


- 3.434721 


-.057198 


23 


-.44 
-.62 
+ .16 


-.3000 


- .142798 
.356506 
+ ,583460 


+ .004156 


■ 1.060744 
■1.368375 
-1.949243 


- 4.378292 


-.280779 


24 


.28 
+.20 
+ .4S 


+.1233 


- .481639 
+ .077153 
+ .365736 


.038750 


-1.658197 

-1.017823 

■1.387011 


- 4.063031 


■ .132837 


25 


.13 
-.56 
I- .02 


-2367 


.107928 
.361036 
+ .408551 


+ .155433 


-1.034813 

-1.377417 

-1.152673, 


- 3.564903 


.193096 


26 


I+.IO 

I+.14 

,27 


-.0100 


+ .111347 
+ .153452 
- 278850 


- .014051 


-1,024632 

"1.070113 

-1.228016 


- 3,322761 


-.014229 


27 


.79 

-.09 


■2200 


- .603260 
+ 242846 
+ .254668 


- .105746 


-2.011377 

■1.142660 

-1.190833 


- 4.344870 


-.344338 


28 


.08 

I +.43 

+.18 


+.1760 


- .273953 
+ .271517 
+ i )04000 


+ .001564 


-1.028078 

-1216408 

-1J000048 


•3.244534 
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DlSTRIBUflON OP MEANS 
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-.0933 

-.066998 

+•'.361036 

+- .019230 

I + .0900 

-.476190 
-.030027 
+ .527542 

+ .021325 



+.085186 
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Sal 

pie 

_No 

n- 


mi-Ti 

4" 

x^'-m 

dL 


dw* 

777^ . 

1 





d^L/djtr^ 


“ 

+.36 


+ 250748 


- 1.185102 



»5 

2 

+ 36 

+.1233 

+ .250748 

- .108436 

- 1.185102 

- 3.533041 

+.153992 


3 

-.35 


- .609932 


- 1.162837 




1 

+•.56 


+1.473657 


- 6.399500 


MH 

■87 

2 

-.60 

-.1567 

- .551721 

+ .626575 

- 1.853371 

- 9.508069 



3 

-.43 


- .295361 


- 1.255198 


ilil 


ra 

-.45 


- .287122 


- 1.241454 


■ii 

88 


-.29 

-.1833 

- .107928 

+ .038693 

- 1.034813 

- 3.814450 



9 

+.19 


+ .433743 


- 1.538183 




n 


mi 



- 1.000300 


mil 

89 

9 

siEbl 


- .341851 

+ .005768 

- 1.341559 

- 3.710134 



9 


■■i 

+ .357620 


- 1.368275 


imi 


1 



+ .164203 


- 1.080198 


Bm 

90 

2 

-.44 

-.2500 

- .197115 

- .002885 

- 1.115161 

- 3.198063 



3 

-.22 


+ .030027 


- 1.002704 


mm 


1 

+.24 


cairiEstiixi 


- 1,080198 



91 

2 

E&3 

+.0800 

EKm!I!^ 

+ .001397 

- 1.001200 

- 3.142172, 

+.079555 


3 

-.06 


- .142798 


- 1.060774 




1 

+.05 

im 

+ .139303 


- 1.057853 


mi 

■92 

2 

+.12 


+ 215139 

-.034122 

- 1.137879 

- 3.632231 



3 

-.43 

mu 

- .389164 


- 1.436499 


mH 


1 

SQ 


- .189672 





93 

2 

tfixi 

+ .0833 


+ .007625 

Bliiii}: IHcl 


+.080990 


3 

+.32 


+ .250748 






1 

EE!!] 




- 1.1330SS 



■94 

2 

+.26 

+.1233 


- .005763 

- 1.057853 

- 3,204354 

+.125098 


3 

+.19 




- 1.013446 



— 

9 

ira 


- .183494 


- 1.098764 



■9S 


cXfil 

-.2233 

+ .720642 

+ 243083 

-2.415765 

- 4.961730 

-.194463 


9 





- 1.447201 





jwiji 

■m 



- 1.019406 



96 

9 

Bil(1 


- .060216 

+ .000291 

- 1.010865 

- 3.031471 



9 

Sid 

mu 



- 1.001200 




1 

-.04 


SS^?B?5E9 





97 

2 

+.28 

+.0433 

+ .250748 

+ .009877 

-U85102 

- 3.279518 

+.040288 


3 

-.11 


-.156989 






9 

-.21 


+ .097612 




mn 

98 

9 

-.18 

msm 

+ .128767 

- .018516 


- 3.240918 



9 

-.53 







ili 


9 


■m 

+ .050125 





99 


BE9 


-.050125 

+ .000000 


- 2.015062 



9 

:ES3 

mm 

+ .000000 


-0.000000 




9 

-.09 

mi 

- .030027 


- 1.032758 



100 


-06 


+.000000 

+ .000000 

-0.000000 

- 2.065516 



m 

■a03 

mu 

+.030027 


-1.032758 
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DISTRIBUTION OP MHANS 


In order to analyze the results of the calculations on the pre- 
ceding ten pages it is necessary that we find the theoretical vari- 
ance for 52 and m from the formulae derived for this purpose 
•Her in this paper. 

From (7) page 90.,we get after setting a-1, /?* 2, and r?*3, 

(10) .047619 
and from ( 8 ) page 90 after similar substitutions 

( 11 ) .033333 

From (9) page 90 we get for the efficiency of the mean when 

(12j /f=.70 

Now by actual calculation from the ungrouped data the mean 
square deviation from zero which we shall designate as 

(13) ■ 77 - - .048611 
and 

(14) ’^^*.047612 

/V 

Comparing (10) and (13) it is evident that such a difference 
can be said to be well within the limits of random sampling. The 
difference between (11) and (14) is of such magnitude that we 
can not say that it might be expected in the course of random 
sampling. Now since im^ is our estimate of m it is evident that 
either a single approximation by Newton's method is not adequate 
to give the best results or the approximation, 


2 a^(p-l) 

^ in " vp(Zp-il) ' 
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to the variance is not valid in the case oi small samples. That the 
latter seems to be the case the writer firmly believes. The reason 
for this belief lies in the fact that in a subsequent case a sample of 
3 was examined by Newton’s method, and starting with 7V, 
* 2 = -.07 the values - .066879 and - .066791 

were obtained. There is not sufficient improvement here to cause 
one to suppose that by taking a third approximation we would 
obtain a variance in keeping with the one derived from theory. 
Also considering the mean square deviation from zero for x and 
it seems that the gain in accuracy to be expected from the use 
of the method of maximum likelihood solution instead of the arith- 
metic mean is not sufficiently great, in the case of samples of three, 
to warrant the additional labor involved in calculation. We must 
be sure, however, that in using the arithmetic mean we are using 
an approximation to m which complies with the qualifications given 
on pages 95 and 96. A graph of the distribution as found from 
the calculations is given on page 106. The histogram represents 
the grouped data while the smoothed curve is a rough approxima- 
tion to the actual form of the distribution. 


Histogram Data 
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The totals have been used in the histogram which has been 
forced to k symmetrical so as to give the effect of a sample of 
twice the size. 

The tabulation of the histogram data draws attention to the 
great excess of samples having negative values for X and 77?^ . 
This has caused the writer no little concern. In examining the 
signs of the observations we note that there are 183 - and only 
117+ values. Assuming + and - values to be equally likely 
this gives a deviation from 150 of 33 or 3.81 times its standard 
error which is incredible. It seems therefore that Tippett's num- 
bers are not random in this respect, and that it perhaps would have 
been better to toss a coin to determine the signs. , 

As a final check and in an effort to place the type of the dis- 
tribution the value of 0;^ has been calculated, and found to be 
/^= 3.056 

01 was not calculated as the excess of negative signs would lead 
to an erroneous value. There is every reason to believe that fii 
should be zero. These facts suggest that the curve is very near to 
the normal curve, but perhaps slightly more leptokurtic. But, 
why, if this is the case, there is not better agreement between (11) 
gnd (14) page I04the writer is unable at the present to state. 
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DBPRnCIATION OP PHYSICAI ASSETS 


The analytical conditions that a function of ;t may haye a 
minimum are that the first derivative of the function with respect 
to be zero and that the second derivative of the function with 
respect to ;a: be positive. If, as is customary, we denote the first 
and second derivatives of U(x) by U fx) and re- 

spectively. and those of T^Cx) by 2‘(x) and JS "(^Respective- 
ly, we find on differentiating 


U(x)^ 


i 


2''(xy2u'(x) 

u (x)= * — -z — ' 


from which it is evident that the life-time of an asset must satisfy 
the two condition.s 

( 2 ) xP'fx)-C-Pfx)^0, P"fx)>0 

In short, the life-time of an asset is given by that root of 
the equation xP'(x)-Pfx)= C which will make ^"(x)>0. 

For example, let us suppose that the repair function is givem 
by the equation = ax^-f bx-t-c , Then }? '(x) =2ox + b, 

TS "(x)‘2a, and the conditions (2) reduce to 

ax^^C*C, a>0 

The life-time of the asset is therefore equal to J(C*cJ7a 
provided the coefficient a is positive. It is interesting to observe 
that is independent of the conkant b . 

3 . , In the preceding discussion no allowance was made for 
the salvage value of the asset. Let us denote the scrap-value of' 
the asset after x years by S(x), then the average yearly de- 
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predation, interest again not considered, is 

C*JPM-S(x) 


(3) 


U(x)^ 


X 


and the conditions which will make U(x)d. minimum are 

(4) P'rx)>3'?x). 

If the scrap-value is a constant, both 3 and S '(x) 
vanish, and the life-time of! the asset is determined by 


(5) xP‘Cx)~C~R{x)^S(x)=0, P"(x)->0. 

The conditions (2) indude (4) if we replace ]Pfx)'^'3 ^fx) 
S(x)^ that is, if in the outset we diminish the repair function by 
the salvage value at time x ; to include (5) it is suffident to re- 
place C ^ the original cost of the asset, by C-3^ the difference 
between original cost and scrap value. With these modifications 
we may treat (2) as representing the general case. 

4. To avoid any possible confusion, let us denote by T(x) 
the total outlay to 1)e recovered by uniform annual charges to 
production during the life-time of the asset. Taking account of 
the re.sidtial value 56i.)we see that 


(6) J{x) ^C->-R(x)^S(x), T Yx)^ e Yx)-SYx) 

and (3) and (4) take the simpler forms 

( 7 ) U(x,)=:T (x)/x, X T '(x) -T(x)=0, P "(x) >S'fx} 

From the first and second of the equations (7) follows: 

^ 8 ) . Trx)^U{x) 

which may be appropriately called the life-equation of an asset 
since its solution yields the life-time of the asset as defined in 1, 
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5. When the repair function and the salvage function are 
known, the real roots of the life-equation may be found either 
by direct methods or by methods of approximation. However, in 
the great majority of cases which occur in practice the value of 
T (xi) is given only empirically, from the recorded experience re- 
lating to the asset in question, and the data available may not lend 
itself to analytical treatment. In all such cases the life-time of 
the asset may be determined approximately by means of the fol- 
lowing simple graphic method. 



Let AB(Kig. 1) represent the graph of the equation 
y*‘T(x,) , constructed in Cartesian coordinates. We shall cal) 
it the total outlay graph, because the ordinate y of any point 
yj on this graph represents the total outlay during the period 
of M years if the asset were scrapped at the end of this period. 
The straight line, OP , joining the origin O to any point P 
onA3, we shall call the uniform charge to production graph, 
It enables us to determine at sight the aggregate amount that must 
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be charged to production during any given period of time in order 
to recover the total outlay OP for the time on the basis of 
uniform distribution over- the entire period aj . If is ex- 
pressed in years, the ordinate U V , oi the point on OP whose 
abscissa is unity, will represent the uniform charge to production 
per year, which is required to recover the total outlay for 70 
years. 

Now it is obvious that this unit charge UV will vary with 
the slope of the line OP . It will be least when the slope is least, 
that is to say, when the point P is such that the line OP is 
tangent to the total outlay graph. The abscissa, OC , of the point 
of contact, T , is then the life-time of the asset under considera- 
tion. 

To determine the life-time of an asset, interest considerations 
being disregarded, we need therefore only construct the total 
outlay graph /d S , then draw the tangent O T > and finally mea- 
sure the abscissa of the point of contact T 
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(% 2)shows the construction when T M has the forms 

mx^*25x: ^ 1000 . 

1000, and 

loss'*- 25 / 1000 

respectively. In each case the life-time is found to be 10 years, 
which verifies the theoretical conclusion of 2: that the life-time is 
independent of the coefficient of X . 

We have seen from graphical considerations that the uniform 
cliarge to production will be a minimum when its graph is tangent 
to the total outlay graph. This condition is precisely the condi- 
tion cxpiessed l)y equation (8). which asserts that when a; is 
the life-time of tlie asset TYxJ , the slope of the tangent to the 
total outlay graph, is numerically the same as the yearly charge 
to production. 

6. We now come to consider the problem of finding the life- 
time of an asset when interest at a specified rate is to be taken into 
account. In this case, the various itenw that make up the total 
outlay, as well as the component charges to production, must be. 
replaced by their present values at some arbitrarily chosen epoch, 
as say, the epoch zero. 

Let us attempt an analytical solution of the problem. LetZl t 
represent a small interval of time. The outlay during the interval 
from if" to is T (i+Ai) -T(i)- If the specified rate 

of interest is, i , and if we represent the discount factor by l/tt + 1) 
the conventional symbol \> , then the present value of 

at the ejwch O has some value between \[ f ir \/* 
and us say 

where & has some value between O and 1. The 
total outlay during the time t , evaluated for the epoch 0 , is 
^therefore 

;( 9 ) C+E\T(uAi)~r(t^y^*^^ 
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'the sum extending over all the time intervals between 0 mA t , 

Now where is the greatest 

among all the fractions & . We have, therefore, 

( 10 ) 

If the intervals A t are all equal and their number n , then 
Af’^ i/r ) , and as n is increased indefinitely A f approaches 
0 Then i id/' approaches t , and' we see from (10) that 
(9) must have the same limit as 

(11) c^ZL [rruA t) - m;] y* 

To determine this limit we write 

At 

where the first factor on the right represents the difference quotient 
which approaches T'(t) as a limit as Zl/ approaches C? as a 
limit. With this relation .introduced into (11), we obtain for the 
present value at the epoch 0 of all the increments of outlay dur- 
ing the time t , the intervals of time being infinitesimal, 


( 12 ) 


Limit 


C+f'y*- T'ftj.dt 


In a like manner we may derive an expression for D {t) , 
the limit of the sum of the present values at epoch O of all the 
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charges to production during the time t apportioned at some uni- 
form rate U to each of the intervals A i . The charge ap- 
portioned to the interval from f to i-f-Ai'is U.At, its present 
value at epoch O is U. The present value of the sum 

of these amounts for all the intervals Z) between (9 and i is 


which for infinitesimal values cA Ai has the same limit as 
Zl U, At , so that finally 

(13) D(i) ‘ Limit 21, U f dt. 

At->o i 

het U(jii) Ije the value which must be assigned to U in 
order to recover , the total outlay for ^ years through 
uniform charge to production, interest considered. Then DCx-) 
must equal T(x,) 

UMi ^ C-^J yt T'Ct). di, 


from which 


(14) 


UM- 


C-hJ v't.T'dhdi 

~w 

dA 


di 


The life-time of the asset is that value of x- in (14) which 
will make U(x) z minimum. The derivative of U(x) with 
respect to x must therefore vanish. Differentiating (14) with 
respect to and setting the result equal to (9, we find 

'(X) jV.di-\^~^ fy*T W, di^ y O. 

from which 


C*J y^~T‘(t)-d'{ 


(15) 


Tfx)^ “ 
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■which is the life-equation of the asset, interest considered. 

7. In deriving equation (13) we apportioned the charges 
to production for an interval Ai and found the sum of the pres- 
ent values at epoch 6“ . Dfi) is the limiting value of this sum 
as the intervals 4^ are indefinitely diminished. If, as is cus- 
'tomary, no charge is made to production until the end of the 
year, this single charge will be the aggregate amount of the con- 
stituent portions for the separate intervals A f , accumulated with 
interest to the end of the year. The charge for the interval from 
t to t+Ai h U Ai ,\i% amount at rate L to the end of the 
year is U (l*i)^.Ai , where t is the time to the end of the 
•year, and the equivalent single charge at the end of the year is 

0 « hmitZ U. (Ulf.At = U. 



0 


8. As an example let us again take T(t)=at 
then 

T(t)^2.d+b j » (v^-l)/log K 
^ '^6 ° 

TIM^ 

= ZQiv*-^-b(v^-i}\/ tog v-Za(v^-l)lOoqv)^ 
(17) 

l(t)^ c-f- Zatv%b(\/'^-if^Jloq ^~Zci{v^-/)/(hg 



J v^(Zd-f-b)cli 


(18) D(i).U.{^^-l)/ /ogy, 
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and (15) reduces to 

^ ^ ^ cClagvf 

(19) —Ja ' 

While the life-equation (19) cannot be solved algebraically, 
it is evident that an approximate solution for could be ob- 
tained from a list of tabulated values of the function v^-iogv^. 
When such a table is not available, an approximate solution to any 
desired degree of accuracy may be obtained as follows ; 

We may write for v.e ^ where e is the base of the 
natural system of logarithms. (19) then takes the form 

(20) . >c/ogi/= J + c(/og 

On expanding the first term of this equation into a power series in 
%, and simplifying the result, we have 

x,\>i log ^)/y -f- X ‘^dog x( fog i !') / . . • » c/a , 


whence 


c/a 

f-^xf log log v'j^jlZ y)^/60-*- • 


Now for all ordinary rates of interest log v is necessarily very 
small, so that if we denote successive approximations of x, by 
etc., 




c/a 


\l/Z 




c/a 


/■^XiCtog \x)/y 

i • 

TTj 




'3 [/4X^ dog i^)/7^x2(/og >^/ ^/// 
c/a 


p> 


//z 


hxg ( tog Y)/3 / {/eg y/ ^12 dog y6C? 


t/z 
, etc. 





Let us take the special case, prewously considered in 5., when 
C « 1000, i?=0, £2 = 10, and let the assumed rate of interest be 
6 percent. Then 

/£?^ K = - Of 3 (fog O OOy ^9f, 

1 1 bg y - -U 1 61. 788, l/(hg a '/ -ZfZ.fZd 967, 

and we find 

T(t)‘IOt^^/OO0, 

7(t) ‘ 1000 • 343 Z4ty'^-f 890 74{y7i)^ 

DCt) = /T/6Z Ufl~ y 9, 
and the life-equation is 


V^-/og / 69 764 



The first, four successive approximations for x give 
X = 10, X = // /4, X. = //.Of 
x^-i/C7 
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The value x ^ /1. 07 substituted in (14) and (16) give us 
and ^26.03 

This value of U(x) substituted for U in the expression for 
DW gives 

V(t}-3dOOJ(/-P*), 

which represents the present value at epoch O of the aggregate 
momentary charges during a period ^ at a rate such as to recover 
the total outlay 11.07 years, the theoretical life-time of the asset. 
The momentary rate is 221,45 per year, the equivalent single 
charge to production at the end of each year is 228.03. 

(Fig. 3) shows the graphs of the two equations. 

f(t)^IOOO- 343.24 ty 3390 
D(t) >3600.3 (/-y^j. 

The abscissa of the common ordinate of the two curves represents 
the life-time of the asset. 

9. It appears from (Fig. 3) that at the point common to 
the two graphs, the graphs have a common tangent as well as a 
common ordinate. To see vyhether or not this is a general prop- 
erty let us trace the changes (n the total outlay and total charge 
to production functions when interest is taken into account. 

In the first place it is evident that the increments of the 
ordinates of both of the graphs in' (Fig, l),jnust be replaced by 
their present values at the chosen epoch, If this epoch is the 
effect in question will be to shorten progressively the ordinates of 
both graphs. The charge to production graph will then be no 
longer a straight line but soirte convex curve, while the total out- 
lay graph will go over into another graph which is less concave 
- than the original graph. But both graphs will continue to rise 
indefinitely as we proceed from left to right because the incre- 
ments of their ordinates, while decreasing indefinitely remain 
positive. 



FIG. 4. 


In (Fig. 4) let and represent respectively the 
total outlay graph and the charge to production graph, interest 
disregarded, T(t) the total outlay graph, interest considered, 
and Dft) the charge to production graph, interest considered, 
through any point E on T{t). The ordinates on 
■sent the present \-alues at epoch O of the momentary charges to 
production during time / at a rate such as to recover the entire 
outlay during the time corresponding to the abscissa of the point 
E , This rate is measured by the initial slope of D(t) , the 
slope of DC t) when t- 0 • 

Let us follow the changes in this slope for the various posi- 
tions of the point E as it moves along T{t) from left to right. 
It is evident that this slope at first decreases, also that it cannot 
keep on decreasing indefinitely, it is therefore plausible that it 
will ultimately increase, reaching a minimum value at the point 
P where the TCt) curve and the Dft) zmvt have a common 
tangent. The abscissa of the point of contact, P, is then the life- 
time of the asset under discussion. 
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10. The foregoing considerations, however plausible, are 
open to objections, because we have reasoned from graphs result- 
ing from the assumption of a special law governing the repair 
• function. Different assumptions might give rise to essentially 
different graphs. We shall, therefore establish the conclusions 
a1x>ve arrived at, by an analytical proof, which is independent of 
any as.suinptions regarding the nature of the outlay function. We 
shall prove the 

Theorem: If the rate U oi & uniform charge to produc- 
tion curve is a minimum, this curve is tangent to the correspond- 
ing total outlay curve, and the abscissa of the point of contact 
represents the life-time of the asset. Conversely, 

If a uniform charge to production curve is tangent to the 
corresponding total outlay curve, U is a minimum, 

To prove this theorem, let y * be the equation of the 
_ , 

total outlay curve, the equation of the uni- 

form charge to production curve, and x the abscissa of a point 
common to the two‘ cufve.s. 

Then T(x) •DM =uj from which 

( 21 ) 

Since bjy hypothesis 4/ is a minimum, it.s derivative with respect 
to X must vanish, that is 

. ( 22 ) 

From (22) and (21) follows 

(23), 

'Jliis shows that at the point common to the two curves their 
slopes are equal, they have therefore a common tangent, and since 

V 
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6^ has a minimum value, must represent the life-time of the 
asset. 

To prove the converse theorem we observe that if the two 
curves have a common tangent at the point zf’= , 

(24) T('x) = 3/JiJ * 6// K 
and 

(25) T(>s)-D’(x)^V’^U 

Substituting the value of U from (24) in (25) we find 
T (>i) = 1/ ^ 

from which 

f(x) TM - (?. 

o 

But by ( 22) this is precisely the condition that C/ is a minimum. 

11. In most cases which arise in practice the analytical 
method of finding the life-equation of an asset fails owing to the 
empirical character of the outlay function. The question suggests 
itself whether a graphic method, similar to that employed in the 
simpler case treated in 7, can be devised, which will yield an ap- 
])roximate solution of the problem. The theorems of the pre- 
ceding article offer the key to such a method. 

I,et us suppose that the total outlay graph has been con- 
structed on a convenient scale, the scale depending on the magni- 
tude of the quantities involved. Every point on. this curve 
determines a definite uniform charge to production curve. We 
seek that particular one of these curves which is tangent to the 
total outlay graph. The abscissa of the point of contact would 



THE SIMULTANEOUS DISTRIBUTION OF MEAN 
AND STANDARD DEVIATION IN 
SMALL SAMPLES 


By ALLEN T. CRAIG 

1. Introduction. If .samples of 77 items are selected at 
random from a normal universe, it is well known that the arith- 
metic mean x and standaid deviation g computed from samples 
ar6 independent in the probability sense and that the simultane- 
ous frequency distribution is 




_ f- r7X^ 

, z a ^ 


1 f, however, the parent population is other tlian the normal type, 
there appears to be little known regarding the form of F ( s), 
In the present paper, we propose to determine the simultaneous 
frequency function of the arithmetic mean and standard deviation 
in samples of small numbers of items selected at random from a, 
rather arbitrary universe. For convenience, we shall classify 
frequency distributions according as the range of the independent 
variable is oa) or aJ,a>(P. We shall further 

assume that the total arqa under the distribution function is unity. 

2. The simultaneous distribution of z and s in samples 
of 77»<?. Let 'ffx),-oi><x<oo be the fiequency '‘unction of 
the variable x Let x, and be two independent!, erved 
values of x , write 1 ' 

xf ^ 
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We seek the function such that Ffx,s)dxd3h, 

to within infinitesimals of higher order, the probability of the 
simultaneous occurrence oix\n(%2i-i:^x}knAs\n{'s,s-f-ds). 
For X and s assigned, x, may have either value S - 3 or 
Z + S and Xjj is uniquely determined by »/2x-X/. 

» ff2-5)ff2x-x,)cix, dx^ 

'riius 

+fCx-^s)f(' Zx-x, )dx, dx^ . 

vSince have 


( 1 ) F(2,s)=> 4f{2s) f( x¥-s). 

If f(x) is defined on the interval oo^, we note, for X 
assigned, that six. Thus (1) is valid for this type of frequency 
function but the surface is limited by the % -axis and the line s*X 

If f(x) is defined on the interval fO, a), we note, for 
X assigned on ( 0, cl/2), that si x ; and, for x assigned on 
(ci/2,a), that si a-x. Accordingly, for this kind of frequency 
function, (1) is valid but the surface is limited by the x ’axis and 
the lines $= x,S=a~2, 

As simple illustrations, let us find the correlation surface for 
the mean and standard deviation of samples of two items drawn 
from distributions of various types. 

Example 1. Let 


~ X < oo 
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the well known result. 

* Example 2. Let 

f(x.}=e'^, 06 x<qo, 


Then 

I * 

F(K,3)^4e'' 


over the, open region of the iBs -plane bounded by the x -axis and 
the line s^x. 

Example 3. Let 

0^x6 a. 

Then 

\ 

over the region of the xS-plane bounded by the isosceles triangle 
with sides S^O, <s = x and s-.a-x . With a uniform dis- 
tributiqii proportional to 4/<2^ over this triangle, it follows in- 
cidentally from very elementary geometry that the marginal totals 
of the distribution of x are given by the known values 

0(X)^-^X^ Oixix, 

jixia, 
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and that the marginal totals for the clistrilnttion of s are given by 

which is the result given by Rider.* 

3. The simultaneous distribution of x> and s in samples of 
■ Consider first a frequency function f(x),~oa<-)i<.oo . 
We have 

Ai, V- V- Xj * J X, 

upon eliminating Xy v/e. have 

2 - 6xXg^~^sK6x^‘0. 

From simple properties of this ellipse, it follows, for assigned % 
and S that x, may be chosen arbitrarily from the interval 
(x~S)/Z,x-t-S ^ ), With X, assigned, X;j must be selected with 
certainty as either 



Finally we must have 


Xj y X -X/ - Xjj , 


> P. R. Rider, On the distribution of ratio of mean to standard deviation 
etc., Biometrika, vol. 21 (1929) pp. 124-141. 
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> nX.+S'JZ 

F{%,s)d%ds-.Z / fU,}f()c^)f(ic )dx,d)c.dx^ 

From 


wc oljtain 

i 

9s 

dXi dx^ dXj « 

\ 

, ^9sdx^dxds//Z 

\ 

here 

Fs[6s^.JZx,-x;^y. 
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If ffk^) is defined on the interval (O, oo)^ ve note, 
for X assigned, that 0 - s ^ Thus, the surface is limited 
by the ^ -axis and the line s=‘X<fZ. Moreover, since 

are non-negative, x, may be selected from the interval 
(x-sjz, x-t-sJzjmXy _as long as 5 \}? /Z ' If 
X s^Xi^jZ. 

then A/y may be selected from the intervals 
{ 3x~\6s^-35o^'k 


and 

Accordingly, ^for thi.s type of frequency function, 



L 

7P 


tcx,) 
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If ffx:) is defined on the interval f O, a), we note: 

for O‘iTi‘^a/3, 0-E.s^x.^-, 

for a/y'ixiZa/y, O^si « 

for a, OEs i ('a-x) JE . 

Thus in this case, the surface is limited by the x;-axis, the lines 
s * ^ yF and 3 = (a -X.)>JE and the hyperbola 

(Lug. 1.). Now »! may' be selected from the interval (2-S-J^, 
x.+s^Z)z^ long as si nWZ and s<{a-2)^IZ . This 
holds for that part of the surface over the region bounded hy OPa. 
h'or that part of the surface over the region bounded by OPP, 
may he selected from the intervals 



Tig. 1 
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It is clear that the ranges of arbitrary selection of x-, for that part 
of the surface over the region bounded hy P\h are 


X- 3 jZ. 


Ca- 




I'inall}', wc find that x, may be selected from the intervals 

-x) j 




for that part of the surface over the region bounded by PC/I/ . 
If we adopt the notation 

we have 

(2.2) F(B.s) - Ws / (^dx,. 

Jx-s JZ 


=18s 


x-s^JZ 

' r 3i6-\3a’^-3B^ 

J' 


= 185 / 
'B- 


3 x 


iaAz-zA 
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?B-<3 J£ -£5s^-.?g^3^ 

' ^ y ■ " 

•4 jE - a y- & 5 Jf/i? ^ 


■ a 


4jE + li5s«-:75S^]^ 
z 




for the parts of the surface over the regions indicated above. 

In order to illustrate the theory, we shall consider a few ex 
amples. 

Example I, Let 

^ ~Z^ 

llv (2), 


xr/- 1 

Example 2. L.et 


f(x)‘‘e ^ , O6x<ot>. 


By (2.1). 


FCx,S)=6j^TTSe^^, 


<P<S < 






arc sirr „ ■, jb -rare sift 


-<xrc sifl 


SZM 

/ 

x+\0s^-^x lA :bM ^ 


Example 3. Let 


fix) 


i. 

a • 


0:Sx<a. 
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By (2.2), 

/rfx,s)» o^er OPa, 


6j^s 


un - ^ K - p - J T — 


\arc sin + arc sin 


£ 

w 




,6jls 


|, over OPU, 


arc sin 


. Z-a-^s^~Xa-2)^ 

trr ■ 


sziz 


iarcsin^-arcstn ^ 


+^j, over PVa, 


A.R^r x-a-[6s^-Jia-£)^]^ 

-^]arc sin Tz]r~-^ 

^arcsin^ +arc 
+ arc sin 

-arc sin i'a-Z) ] 

szjl 


-arc Sin 


x-&hs^-3x^l^ 


,over PVU. 
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I have succeeded in obtaining the marginal total? for S 
from 0 to aJ^/4- hy integrating F{^;s) 'mk'a respect to x 
from the boundary (Fig. i)s^xjE to and ob- 

tain as a result the parabola which is known^ to give the distribu- 
tion of s from S'O to s ~ 

4. The simultaneous distribution of x and sin samples of 
r)» 4 . We shall consider first sanxples of four items drawn from 
a universe characterized by a law of frequency 'f(x), - <w<;e< on, 
Then 

X, 

t 9 y ^ 

The elimination of yields 

~4xXj^-4xx^ •2s^^^X^~0 

It follows from the properties of this ellipsoid that x, may be. 
chosen arbitrarily from the interval Cx-sjf, X*s . For x, 
assigned, the region of arbitrary selection of is determined by 
the properties, of the ellipse and is 

{ 4x~x, -2, [6s^-Z(x,-x)^^ xx-x,+z\efs^-Z{x,-xfY 

I 5 j 

Upon solving for x^ in terms of x, and we have 



while is uniquely determined by « 4x-x, -x^-x^ . 
If we write 


Tslps^-dx^^SiltXi i-SUx^-Z 




*H. L. Rietz [Paper to appear presently in Biometrika], 
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( 2 ) ■4-x-;iii+^0s^-Z(Xr^) ] 

F(k.s}=Xs / ^ 

'^x - -2{6s^-Z(>ir^fY 


^idx^dx^. 


The integration can be carried out in an obvious manner when 
f(»X is the normal frequency function. 

In case f{x} is defined on the interval a>), we note, 
for }i assigned, that s < 5 . Thus the surface is limited by 

the 5 -axis and the line s - x/T Moreover, X/'-raB-y be 
selected from the interval Cx-sJJ, x+sj?) with Xj^ chosen 
as above only as long as 5 < If » J3/3 is 6 Z, 

then may be chosen from either of the two intervals 

(o, ^ 

. ^ and ' 




with X. chosen as above; or may be selected from the interval 
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and 


;Sx^[£S^-^ X i x^£ 3 


with either 


O^xj^ 6 x,-[S£^~Sx^-y»f'^02x,']i ^ 


or , 


Z 

I 


<v/ < 

■ ■«■ 5 


Or we raav have 




with 


4-%-x, -Z\6s^-Z(x,-x)^'^ ^ ^ 


Accordingly, {or this kind o{ {requency function, ^ 

rZ+sJS r A-Z-X,+Z\.0$^-Z(Xr^)^y , , 

^ \r~ ^ 

•^- 5 ^ \P$’^-Z(x,-.xff 

( 3 . 1 ) ^ 

Ossi^ > 

' f 4Z-Z \^3^~ZZ^^r4x -X. 4Z^3 *-ZCk. -JV^I ^ 

3 / J T ^ ^ 

<7 1 — g - --j ' 

X+s^ 4Z-x,*Z\^3*~Z(x,-}i)*\ ^ 

j4x+Z^^-ZZ^ jA-Z-x.-zSps^-zfx, -x)^] ^ 
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Ms s<2j3 
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By similar reasoning, the writer has deternuned F(x, s) 
for rT=i 4- and defined on the interval fO, a). 

The results, however, are quite lengthy and formal and will not 
I)e presented here. 



THE LIMITS OF A MEASURE OF SKEWNESS 

By Harold Hotkllikg and IvKonakd M. SowAroxs, Columbia University 

The measure of skewness 

mean - median 

2 = 

standard deviation 

is sometimes recommended because of its simplicity. Obviously 
neither this nor any other statistic can be of much value until 
something at least is known of its distribution in samples from 
populations of some plausible form. For populations near the 
normal foim the inefficiency of the median as a statistic of location 
suggests that the standard error of § may be considerably greater 
Ilian that of ^Ve know of no investigation of the sampling 

distribution of 2 ■ Api)arently even the range is unknown. The 
object of the present note is to show that 3 necessarily lies be- 
tween -landl. 

I 

The proof consists of three successive transformations of the 
.sample, each increasing q , which nevertheless in the end remain, s 
less than unity. 

1, Without loss of generality let us suppose that the median 
is zero and that the mean x is iwsitive. Taking 

71 77 

77 being the number of observations, which we suppose odd, we 
have 

jS - »/cr. 

If a negative^ observation - a be replaced by -zero, the mean 
is increased hyg/n . In the second of the expressions above for 
(Xf the mean of the squares is diminished by , while on ac- 
count of the change in the mean, a further subtraction is nif^de. 
Thus O' diminishes. Hence a increases if we alter the distribu- 
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tioii liy replacinjf all the negative observations by zero. The median 
remains unchanged at zero. 

2, Let ns further transform this altered distribution by re- 
placing all the positive oliservations by the mean of these positive 
quantities, The general mean is left unchanged liy this transforma- 
tion, but the .standard deviation is diminished. For, denoting by 
g the deviation of a positive dbservation from the general mean, 
E is, for a fixed value of E Z, a minimum when all the a's 
are equal. 

3. Thus the value of s is increased when we replace all the 
negative observations liy the median value 0 and all the positive 
observations b)' a fixed quantity, which we may take as unity. Let 
there be ib<3saiKl k I's m this distribution. Then h-hk^n, 
iloreover, since the median is at 0, The mean is while 

27 tl>0^+k-l^~kV^ = f2k/Tj 
Hence 3=:{k/h)^ 

in this case in which s is a maximum. Since as just remarked, 
h>k, ^ this is always le,ss than unity, approaching unity when the 
observations are divided as nearly as is possilde equally between the 
' two values. 

To go further in a study of the sampling distribution of 3 is, 
[Kissible only on the basis of special assumptions. The same is true 
of the somewhat more familiar but less definite measure of skew- 
ness. 

mean— mode 
standard deviation 

lt< is clear that there is no limit of the range of this last quantity. 
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THE THEORY OF PROBABILITY FROM THE POINT 
OF VIEW OF ADMISSIBLE NUMBERS 

By Artih'R H. Copeland 

I. INTRODUCTION 

The definition of the word probability has never been agreed 
upon. Before we decide on a definition, let us first consider what 
use we hope to make of the theory of probability. It is reasonable 
to demand of this theory that we shall be able to apply it, and that, 
by means of it, we shall be able to make predictions. 

If we say that the probability is .9 that a given event will oc- 
cur under certain circumstances, then are we making some pre- 
diction about the .succe.ss (i.e. occurrence) of the event? Let us 
suppo.se that the circumstances are presented. We may observe 
that the event succeeds or we may observe that it fails. Which- 
ever the case may be, the result of the experiment cannot lie inter- 
preted in terms of the number, .9. This is always the case, ,We 
can never interpret the result of a .single trial of an event in terms 
of the probability of that event. 

Next let us assume that t? trials are made of an event whose 
probability is .9, and that, as a result of this exjierimcnt, r suc- 
cesses and v-r failures are obtained, If r? is large, we should 
expect the ratio, r/v,to be approximately ,9, that is, approximate- 
ly nine-tenths of the trials tobe successful. We shall call the num- 
l)er, r/ 77 , the success ratio. 

We have not even now obtained a satisfactory interpretation 
for the number, .9. We have not specified any limit to the dis- 
crepancy between the numbers, ry77and .9, and we have not 
specified the magnitude of 77 . Thus, if r/rj differs from .9 by a 
small amount, it also differs from .899 by a small amount. Are we 
to be satisfied with the statement that the event in question has a 
multiplicity of probabilities including the numbers, .9 and .899? 

We can make the above statement more exact as follows: 
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Given any positive number, £, we can find a number, r?, such that 
the discrepancy between r/n and .9 is less than£. After the 
number, S , has been chosen, it is at least conceivalile that a sufifi* 
cient number of trials can be made so that r^v will differ from ,9 
by less tlian£. If we make this interpretation of the probability, 
,9, and if we wish to make the statement that .9 is the probability 
of the event, then we are assuming that .9 is the only number that 
; has this proiierty. We are therefore assuming that the ratio, r/??, 
niiproachei .9 as n becomes infinite. 

So far as I know, no one has ever given an alternative concept 
of ))rol)ahility which is capable of being interpreted in terms of the 
result either of a single trial or of a sequence of trials, Unless 
and until such a concept is given, we are compelled to assume that 
|)robability is the limit of the success ratio, if we wish to include 
an eniiierical inter])retation. Since this paper is l^ing presented 
to a group of statisticians, I thiuk it will be safe to assume that 
we arc agreed that probability is concerned with the results of trials 
of events. 

It may be that wc arrived at the probability, .9, by means of 
the following reasoning. There ai'e 9 possible ways in which the 
event can succeed and 1 in which it can fail. All 10 possibilities 
are equally likely and mutually excUnsive. 

When we make a trial of the event, one and only one of the 
possibilities succeeds. The words, equally likely, have no inter- 
pretation m terms of the result of a single trial. The reader will 
have little diffienb} in conlbuiing the analysis of these words in a 
manner similar to dial of the concept of probability. In fact the 
concept, being equally likely, is identical with the concept, having 
the same probability. We shall, therefore, reject the concept of 
equal likelihood as a basis for a definition of probability. 

There is one other objection to this method of finding the 
probability of an event. ’ Namely, there is good reason to believe 
that it never gives the correct result. In making this statement we 
are assuming, of course, that probability is defined as the limit of 
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the success ratio. In order that the 10 possibilities may be equally 
likely, it is necessary that there be perfect symmetry between these 
possibilities. We cannot, therefore, have any mark to distinguish 
the one unfavorable possibility from the other nine favorable 
possibilities. Experiment indicates that such distinguishing marks 
are sufficient to make noticeable differences in the probabilities. 
For example, the dots on the faces of a die cause differences in the 
frequencie-s with which the respective faces turn up. 

In spite of these objections, the above method of finding the 
probability of an event, gives very good approximations in most of 
the cases where it is applied. There is no method which gives expct 
values for probabilities. It seems wise not to reject this method, 
but rather to discard any illusions which we may have concerning 
the exactness of its results. 

We have seen that we must assume the probability of an event 
to be the limit of the success ratio, if we are agreed that probability 
is concerned with the results of trials. Let us express this assump- 
tion in terms of the Cauchy criterion for the existence of a limit. 

Given a positive number. € , there exists a number, N , such 
that I r/ry-r/T^'kfwhenever 77 i/ya.nd ri'=N, where r is 
the number of .successes in v trials and r' is the iiuml>er of suc- 
cesses in p' trials. Physical experiment seems to indicate that 
this condition is .satisfied. Furthermore, if we reject thi.s assump- 
.tioii we deny the possibility of experimental verification of prob- 
abilities. On the other hand, -it can he proved that the number, 
N , can never be known. This, situation is unsatisfactory for a 
mathematical theory. 

To avoid this difficulty we shall construct an imaginary 
idealized universe in much the same manner as is done in the case 
of geometr)’. This universe will contain sequences of successes 
and failures which are formed in accordance with matiiematical, 
laws. These sequences will .satisfy the fundamental assiimijtions 
of prohaliiluy and hence will be infinite. We make the assump- 
tion that the physical universe is an approximation to this idealized 
universe. 
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II. THE ALGEBRA' OF EVENTS 
We shall show how the elements of the theory of probability 
ain be. treated from the point of view which we have described. 
Gor^ider first the following physical example. A coin is flipped 
ten times and the event in question is the occurrence of a head. 
The following is a record of the successes and failures, 

/, 1, 0, 1, 0, O, Q -1, Q O 

where the I’s stand for successes and the O's for failures. The 
ratio, 4 / 10 , of the number of successes to tlie number of trials, 
is obtained by adding all of the ten numbers and dividing by ten. 

If we had made a much larger num})er of trials of the event, we 
should expect that the corresponding success ratio would have 
been much closer to the probability, one-half. 

The above sequence of is and (9s can be interpreted as a 
number written in the binary scale. Let us write 
. UO, WO. 010,0 

This number has the value, •*■1/4 *-0/S '^0/^Z'*0/64*C/lZB 

*-i/Z^6-*0/jilZ*O/0Z4*ZO$/Zy6 . We should not, however, 
think of this number as ending with the tenth digit. In fact we 
could compute as many more of the digits as we desired by con- 
tinuing the experiment. The computation of the values of these 
numbers will not be important for our purjTOses. The above com- 
putation was inserted merely to aid in the understanding of the 
notation which we shall describe. 

We shall now consider -the construction of our idealized uni- 
verse. The sequence of successes and failures of a given imaginary 
event can be represented by a number, 
written in the binary scale, the kth digit, of X being.? or 6? 
according as the evei;t succeeds or fails on the, kih trial. We shall 
denote the success ratio for the first 77 trials of this event \)yp^(x). 

Then 

p 


( 1 ) 



ARTHUR B. COPELAND 


147 


We shall denote the probability of the event, x > , by /O^X) and we 
shall define by means of the equation 

( 2 ) pLxJ^ lim Pr, Cx). 

•h*ao ' " 

We are, of course, assuming that this limit exists. 

Most of the important questions in the theory of probability 
involve relations between different events. We shall therefore 
construct an algebra which is especially adapted to the discussion 
of related events. If Ai.: - are any 

two events, then the event, x and y , will be denoted by a; ■ y . 
We have the equation, ' 

(3) y. ■■■■ 

The first digit of a: • y is i if and only if the first digits of x and 
y are both f That is, the event, .t y, succeeds on the first 
trial if and only if x and y both- succeed on the first trial. 
Similarly for the sepond and third trials etc. The expressions in- 
side the parentheses are understood to be ordinary algebraic prod- 
ucts, The expression, a; -y , is a s)nnbolic product. 

The event, i or y or both, is denoted by We have 

the equation 

t y CO <0 iO ccx , ce) ci) a) ) 

(4) x'^y = (x^ Uy -x y ),(x -x y 

It will be observed that the first digit, of X'^y 

isi if or M if but that 

this digit is 0 if x^'^oy^'^0. Thus the events x '^y, succeeds on the 
first trial if x succeeds on its first trial or y succeeds on its first 

trial or both x and y succeed on their first trials. Similarly 
for the second and third trials etc. 
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We shall use the symbol, x , to denote the event, not x , 
It is easily seen that a/ x is given by the equation, 

Let us denote the event, y \i X , by ycx; * Before attempt* 
ing to give a formula forycx; let us first consider the expres* 
sion, This expression is equal to the numl)er of successes 

of the event, X , in its first m trials. Thus if is the number 
of the trial on which the nth success of % occurs, then^ 

( 6 ) ' 

We can write , 

(7) 3 ’ • • 

Thus we consider those trials of y for which the event, % , oc- 
cufs. In other words we consider a given trial of y if (and only 
if) X occurs on that trial. Hence equation (7) gives us the cor* 
rect expres.sion for the event, y if x . 

[♦The operators, ’ , ^ and 'v are also used in symbolic logic with similar 
interpretations, See Whitehead and Russell, Principia Mathemlica, vol. 1, 
The symbol, c , is an inverted implication sign. The expression, y c x, could 
be read, y is imiilied by x , or, y if . For the benefit of those who are 
familiar with Principia Mathcmalicct, it may be added that the symbols, x, 
y , etc. are propositiondl functions rather than propositions. Each x is as- 
sociated with a sequence of events, ahd each is a propositional function of the 
form, the event will succeed, being a free variable. The probabil- 
ity is a property of the set of propositions rather than of any giyen proposi- 
tion. Thus we should s^k of the probability of a propositional tunction 
rather than of the probability of a proposition.] 


1 
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PROBLEMS 

In problems, 1 to 3, assume that and y have the following 
values 

- JW, 100, 010, OH 101, 000, 10 • ■ 

no, 111, 011,000,100, 010, 11 ■■ • 

1. Compute and 

2. Compute the first 20 digits of (a) a: ■ y , (b) x N/y, (c) /u x; , 
(d)yvvx . 

3. Compute as many digits as possible of y c;>i and c y . 

4. Prove the following identities : 

(a) aj'y-y x (g) Ainj x ~ t 

(b) , x (s/'i)*(xy)'Z (h) A/(x.y)>‘/vx'<'^y 

(c) XV y = y</x (i) ru(xyy)s A/jd- A/y 

fd) xv(yvz)a(xyy)yz (j) x^rvx^l 

fe) »'^y'^z;=(x yjvfxJ'z) {k) (x y)v(x' n)y)rcX 
(f) xy(y>z) ^Cxvyy(xyz) 

5. Prove that fx vy) » (x)^'P„ (y) (x y) 

6. Prove that p (x vy)* p{x) i-pfy) -pfx y) 

7. Prove that p(Ajx) £ l-p(x) 

8. Prove that p[y'njx] = p(y) -pfX"/) 

9. Prove that if x-A/CyvzvwLtJ then x=(xy)v(x'z)v(x w) 

( 
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■ TII. THE COMPUTATION OF PROBABILITIES 

Wc shajl say that two events, x and y , are mutually ex- 
clusive provided » fails whenever y occurs and y fails when- 
ever X occurs. It is easily seen that x and y are mutually 
exclusive if and only if xy^O. It follows from problem (6) 
that 

( 8 ) p(x-^y) ~ p(x)+p(yX i'f x.y-'O. 

If we have three events, x , y , and z , which are mutually ex- 
clusive, then x y ’y' z ^ :i x = O . Hence 

p(xvy'y2)a p(X''y) -i- p(Z ) = p(x,)i- p(y) + ^(z). 

We liave the following theorem. 

Theorem 1. If the events, are mutually 

exclusive then 

Supi)ose we have a .set of events, x, , Xg, • • , such that 

at least one of the events must occur, 'thtn 
Suppose further that these events are mutually exclusive and that 
their probabilities are equal. Then pfx,)-fp(xj* • • ■= 1 

and therefore »••••= = ^/t?. This principle is 

very useful in the comjmtation of probabilities. , 

Example 1. From a pack of 52 cards 1 card is drawn. What 
is the proljahility that this card is the ace of spades? It is reason- 
able to a.ssume that the probability of drawing any one of the 52 
cards, is the same a.s that of drawing any other card. Thus we 
have 52 cvent.s which have the same probabilities. Moreover the.se 
events are mutually exclusive and it is a certainty that at least 
one of the events will occur. Hence the desired probability is iM 
Exhmplc 2. From a pack of 52 cards, 13 cards are drawn. WhaL 
i.s the prohaliility that these cards are all spades? We assume that 
any combination of 13 cards has the same probability as any other 
conihination of 1.1 cards. Since there are such combina- 

li-ms, the prolwbihty is 1 , l/6}% 013 , } 79,600. 

We shall now compute the probability of the event, y c it . , 



ARTHUR H. COPELAND 


151 


We h;ive the equations, 

' D -77 

mr, 




where r??^ • (ic) = rt. 

If we allow n to liecome infinite we {jet 


flO) Pfyc)c)^ p('x y)/pfx). 

Multiplying; l)oth sides of equation (10) by p(x)'Nt get 

(11) pfx)' p{yc.x)’‘ pCy-x). 

Rx:uni)lc 3. A pack of 52 cards is divided into 4 piles of 13 
cards each. One pile contains just 1 heart and the other 3 piles 
contain 4 hearts each, A jjite is selected at random and a card 
is diawn from this pile. What is the probability that the pile 
selected will be the one containing just the one heart and that the 
card selected from this pile will lx; the heart? Let y represent 
the drawing of a heart and X represent the drawing 6f the pile 
containing just one heart. ^\sxn p(x)-lM'xaA pyycx)^^!/!}. 
Hence pfy x)*p{x)-p(ycx):l/^Z This is the desired prolxibility. 

We shall say that an event, y , is independent on an event. 
X . provided the probability that y will occur is the same 
whether x occurs or not. If we express this condition for inde- 
pendence in terms of our symbols we will get 

( 12 ) pyycx)^ pCycr^x), 

Hence 

p(y-^) _ p{y-fvx) _ p(y)~p(y-x) 

(13) pfx) ~ pf/\jx,) ~ 1-pfx) 


Therefore 

(14) P(x y} a p(3i)-p(y) 

It is a simple matter to reverse our steps and start with equa- 
tion (14) and obtain equation (12). Moreover, from the sym- 
metry of equation (14) it is easily seen that if y is independent 
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of X then x is independent of y . We have now proved the 
followiiif,' theorem. 

'I'heorem 2. A neces.sary and sufficient condition that two 
events. X and y , be independent, is that p(X-y)=p(x)-p(y), 
Example 4. A coin and a die are thrown together. What is the 
probability that the coin will turn up a head and the die will turn 
up a 3 ? I,et x represent the occurrence of a head and y repre- 
sent the occurrence of a 3. Then and p(y) ”1/6 . Since 

, the events are independent it follows that p(x y)~J/}Z 

It .should l)e observed that equation (11) is always true but 
tlwt equation (14) can only u.sed when the two events are in- 
dei)endent. The term, contingent, is u.sed to ap])ly to events which’ 
are not independent. If X and y are two contingent events 
we mu.st use ecination (11) to conqHite pfxy). 

In order that three events, .Ji . y , z , may be independent, 
it is necessary and sufficient that p(x-y)<”pfx)p(y), 

p(y)p(z), o (2 3C)r^p(2)p(x),p(x-y'z) 

= pMptyi)” p(y)p(z x)= p(z)p/x-y), 

Thi.s delinition is easily generalized to the case of p events. 

, It is generally assumed that the trials of an event are inde- 
pendent. What does this assumption mean? Supiwse, for ex- 
ample, that we wish to say that the first trial of an event is inde- 
pendent of the second. The first trial constitutes an event, X, , 
and the second trial constitute.s an event, . hut we have only 

defined one trial of x, and one trial of . Indeiiendence is 

defined in terms of probabilities, and probabilities can be given 
meaning only in terms of sequences of trials. 

We call get around the difficulty in the following manner. 
Suppose we wish to consider the independence of 77 trials of an 
event, x . \Vc will consider r? events, , Zjj. jSj, • < .The 
first trial of z, will be the first trial of x . the first trial of 
will be the second trial of x , tlie first trial of will be the third 
trial of z , etc. The 2nd trial of z, will be the (p+t)^ trial of 
z , the 2n(l trial of will be the (TH-Z)nd\x'\i\.pi z , etc. In 
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jveneral, the digits of the number, , are selected from the digits 
of the numlwr, x . The digits selected arc, the rth , the (r-fr))ih , 
X\\^(riZn')th,(r-^)rf)ih- etc. 'I'hat is 
(IS) . 


We can now speak of the indeiiendence. of the numlicrs, 

. . . if 77 . 

It will be observed that 





and hence we can write 


( 17 ) 


xc 


l-Z'^ 


W'e sliall ablireviatc this notation still further and write 


(18) { r/yi)^:^ xc. — . 

l-Z'^ 

It is natural to assume that p[(r/n)x]^ pCxJior every pair of 
nuinhers. r and n , such that 0<rin. If we assume this, and 
if we assume .that the numbers, (l/v)xjZln)x,0/v)3i . . .(j7/?^x, 
are independent, then » must .satisfy the following equations. 


( 1 '-^) pi(r,/p)x ■(ri/v)x ■ • •(rj(/n)x\=.{p(x)\^ 


for ever) ri and for every set of integer.s. r, . , Tj. , . . , 

such that O < r, < ri< ■ < ■ 4^ ^ rt. 

' .\ny number, x , which satisfies eciuations (19) is called an 
admissible number. It can 1)e proved that there exist admissible 
numbers.* It is clear that an admissible number, x , characterises 
the liehavior which we should expect from a sequence of trials of an 
event with probability, 


l*See the author's article, Admissible numbers in the theon of Probabilily, 
American Journal of Mathematics, Vol. L, No, 4, Oct. 1929]. 
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Example 5. An event, AJ , Iia'i tlie probability, pM ■ What is the 
probability of obtaining jirecisely two successes in three trials of 
the event? It is rer|uired to find 

-[0/3h (IWx-rvf^/^h-]]. 

Each of the square brackets contains three inde]iendent numbers. 
Thus for each square bracket we have the probability, pfrun). 
The , square brackets themselves constitute three mutually exclusive 
events. Hence the desired probability is ^[pM] p('yx). 

Let us find the probability of r successes and n-r failures 
in n trials of an event Let ppc3=pc\n<\ p('vio)=q . ^'he prolm- 
bilit\ that a given set of r trials will ail be successful, is p '^ . and 
the probability that the remaining n-r trials will all be failures, is 
^n-r ^ successful trials can be chosen in ways. 

vSince all of these ways arc mutually exclusive, the desired proba- 
hility p'^q . 

Consider the following problem. Let ai , , be a 

set of mutually exclusive cvent.s whose i)iobal)ilities are known. 
We .shall call these cvent.s causes. lyCt y be an event w'hich can 
occur only as a result of one of the, causes. The irrobahilities of 
y if a?/ , y if . etc. are also known .\n experiment is per- 
formed and it is observed that y otcur.s. What is the proliability 
that this occurrence is a ic,sult of kih cause 'I'he answer to this 
question is given by the following theorem. 

Theorem .3. Jf x, . is a set of mutually ex- 
clusive events, and if y is such that O ^ 

then 

P/'x^cy)- -s — ~ — :: : • 

^ £ P(Xi) p(yc.xP 

tPi 

, Since y ---''x^^O W. follows that 

y= {y\x,)'^Cy x^)''" • 

Hence P(y) - p(yxp ■4- pCy pZy x„) . 
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Therefore 

p(y) = p(x,) pfycz^)i-p('x^)pCycx^)-i- ■ ■ ■ f p(x„)p{yc3C^. 

T(3 complete the proof of the theorem it is only necessary to sub- 
stitute this value of p(y) in the equation, p(x ip:y)= pfxpy)/p()iX 

and then suh.stitute p{x^) pfyex/^) Uye p^x^^ y). 

Theorem 3 is known as Hayes’ principle The proljahilities, 
p(X,), p{x^)y . . p/x„). are called a I'rioii probabilities, 

whereas the iirobabilitics, pfx,cy), p{x^cy), . . . pfx^cy), are 
called a patllori. 

Kxample 6. There are four urns. .U, , (J^, U,. The urn, , 
contains three black balls, I// contains one white ball and two bl.ack 
balls, L/^ contains two white and one black, and contains three 
white balls. An urn is selected at random and a ball is drawn from 
it and found to l>c white, ^^'hat is the probability that the ball 
came from 4-^ ’ Let • X, , Xj represent respectively the 
drawing of ,U, U^. and let y represent the drawing of a 

white ball from the urn selected. Then 

p(xp = pfx,) pfxp = p(Xf) » 1/4, 


ami p(ycxp -- 0 . p(yc x , ) « //^. pfyex^ )=,2/},pfycx/. 


Hence 


pf'x.cy)^ 



Kxample 7 'iTvo people, A and 15. make the same statement iii- 
depcndenlly. Let this event he denoted by y . Let X denote the 
event that the statement is true. Then y can lie the result of two 
causes, x and Xp'^^x. It is giren that the probahilities of A 
and J> speaking the truth, are resjwtiveh a and b. What is the 
a pixliori [inihahility that the .statement is true? We know that 


p(ycx,) = ah and p)fycxJ=((-^)(^~^^ ■ Hence 


pfxcy) = 


a.b p (x) ^ 

a 't ■ p(x)4-(J-QXl-b) -plnjx) 
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m 

It iiiifilit l>f fiddl'd by wav of warning ihiit it is easy to state a 
problem of this kind, which is without meaninfj. 

Uet us coii.sider the problem of finding; the probability that an 
event, X- • will proceed an event, y , a tie being excluded. We 
have the four po.ssible situations. ji-y, 

The tirst .situation represents the tie. and this situation we have ex- 
cluded. In the second situation neither x nor y succeeds, and 
this situation should also be o.scluded. The event,*; , will proceed 
y provided X succeeds and y fails if either of the last two 
.situations occurs, llcnco the desired i>iobability is 

I ^ JJ p(x-n>y)p p{y 

When X and y are niiUually eKclusivc this last ex]n‘e.s.si(>ii takes 
the following torni: 

pM 

pCx) p p(y) 

IV. CONCKUSION 

'rite aliovc examples illustrate how the theory of prohahility 
can lie developed in terms of our idealized universe, lly this 
method we can construct a consistent mathematical theory, and one 
Which admits the possibility of experimental verification. 



RELATIVE RESIDUALS CONSIDERED AS 
WEIGHTED SIMPLE RESIDUALS IN THE 
APPLICATION OF THE METHOD OF 
LEAST SQUARES 

Walter A. Hundricks. 

/imior Biologist, Bureau of Amnwl Industry 
United States Department of Agriculture 

In a recent paper the writer’ discussed some considerations in- 
volved in fitting a curve, by the method of least squares, to data 
in which the magnitude of the errors of measurement was affected 
by the size of the dependent variable. For the special case in 
which the percentage errors of measurement were distributed nor- 
mally, it was shown that the most probable values of the depend- 
ent variable could be calculated by minimizing the sum of the 

Y 

squares of residuals of the type, > with respect to 

V , V being the arithmetic mean of the ratios of the observed 
values of the dependent variable to the corresponding calculated 
values and equal to unity at that minimum. 

The concept of a relative residual has a certain value to the 
investigator as an aid in visualizing the nature of such a set of 
data. However, it is possible to use a different method of analy- 
sis, based on the theory of weighting, which will yield exactly the 
same results when applied to such, a set of data and in addition 
possesses the advantage of being applicable to more general prob- 
lems in which the relation of the, errors of measurement to the 
values of the dependent variable is more complex. 

Standard texts on the method of least squares such as that 
by Merriman,“ show that if the probability of the occurrence of 
an error of a given magnitude varies for measurements of suc- 
cessive values of' the dependent variable, it is necessary to weight 
the observation equations when fitting the curve. If the errors of 

^Hendricks, Walter A, 1931. The use of the relative ffesidual in the appli-’ 
cation of the method of least squares. Amials of MafHematkol Sialisiks, 
2 (4) ; 458-478. 

»Merriman, Mansfield, 1907. The method of least squares. 230 p., illus. 
John Wiley & Sons,, New York. 
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measurement, made in obtaining one observed value of each of 
several successive values of a dependent variable, f(X) , are in- 
fluenced by the majinitude of, f^X) , the probability of the occur- 
rence of enors of the magnitudes, , 

respectively, is given by the following equations: 

P„-.k„e'^r,^n ( 1 ) 

The probability of the occurrence of the given system of “’•ors 
is given by the product : 

p'^k'e ^ ^ ( 2 ) 

in which P' - Pi-P^. ^and k'^ k^.k^ k^ •■k^, 


ff the exponent of e in equation (2) is dl\ided l)y a con- 
stant measure of precision, , the equation may be written in 
the form : , , 

P'^ ^ ( 3 ) 


in which ^ p,h^ , etc., and p, , p^, are the 

weights of the corresponding errors. P ' will have its maximum 
value when the value of the expression, 

IS a minimum. 

Applying the above principles to curve fitting and substituting 
a residual, V , for every error, x , to distinguish the residuals 
from the true errors, it is evident that the constants of a fitted 
c(iuation must be determined in such a manner that the value of 



the expression, p^v,^i ^ , is a 

minimum.* 

If the equation to be fitted is of the type used in the writer’s 
previous study (loc. cit.), viz.: 


Y-AX‘ 


■ . (4) 


this condition is obviously satisfied by the solution of the follow- 
ing equation : 


d^, dv^ dv 


Let represent any observed value of the dependent vari- 
z 

able and let represent the corresponding most probable 

value. Then it is evident that: 

Vi^AKt-Y^ ( 6 ) 

I 

and 


dvi 

dA 


. . (7) 


All that remains is to find the weight, pi . 

Equations (2) and ‘(3) show that the weights of the errors 
are projwrtional to the respective measures of precision, h,^ , , 

hf, — . It follows from the well-known relation between 
the measures of precision and variance that any measure of pre- 
cision, , is equal to which is the standard error 

of the observed value, y , of the dependent variable. There- 
for, any weight, , is given by the equation ; 



( 8 ) 


*The above development follows that given by Merriman with a slight 
cliangc ill notation. 
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in vhich Z?"® the coii'^tant measure of precision in equation (3), 


If a set of data were obtained by making one measurement 
of each of several successive values of the dependent variable, 
AX^ . Olid the resulting percentage errors of measurement were 
distributed nprmally, it follows that the coefficient of variation of 
a number of replicate measurements made at any value of AX^ 
would be ecjual to that obtained' for every other value of AX'^. 
In other words, the standard error of every measurement would 
he direcllv proportional to the value of AX'^ measured. 

Since, by equation (8), the weight of, any error of measure- 
ment is in\ersely proportional to the squ.'ire of its standard error, 
it follows from the aliovc discussion that this weight must also 
be luveiacly proportional to the square of the value oi AX^ 
measured. Combining all factors of liroportionality into a com- 
posite constant. C , the relation between any weight, p^ , and 
tlic corresponding value of the dependent variable, AX.^ ^ , may 
he expressed by the equation : 


(9) 


If the substitutions suggested by equations (6)', (7), and (9) 
are made in equation (5), this ecpiation may be wrilten: 

C(AXl-Y^)Xl , C{AX^-Y^)X", 




-o 


( 10 ) 


A^X^ 

Since the constant, C , is common to every term in equation 
(10), it may be removed hy division. The e<iuatioii may then be 
reduced to the form: 

T' (AX^~Y)X^ 


A^.K‘* 


-0 


( 11 ) 
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If there aie n observation equations, equation (11) may he 
wiilten in the form, 


hA rX 

A ^ 


( 12 ) 


from wliieli the most probable value of A may he leailily calcu-' 
latcd, ' 

nA-X-A - () 


X" 

or 


^ „ -Z r X 
A „ A 


(13) 


ICriuation (13) is identical with the equation olitained by mini-, 
mi/ing residuals of the type, ■ reported in the writer's 

earlier study (loc. cit.). The development given in the present 
paper is perhaii.s the better from the purely mathematical point of 
view since it involves nothing more than a .sy.steniatic weighting 
of the ohscivation equations. It can he ap])lied to any problem 
in curve fitting if the standaid error of each ob.served value of 
the dependent variable is known or can be deduced from a priori 
considerations .For example, it often happens that the means of 
replicated measurement.^, rajher than the individual mcasurenients 
themselves, are irsed in litting the curve. In such cases the stand- 
ard error of each mean may easily he calculated. The leciprocals 
of the squartss of these standard errors will then he thc lequired 
weights of the observation equations 

However, if the standard errors are projrortional to the values 
of the dependent variable, it may he desirable to retain the con- 
cept of a relative residual. .The siguificarice of a percentage error 
of measurement probably can l)e appfeoated by niany invc.sti- 
gators in various lield.s of Research, particularly those whose con-, 
tact with mathematu’s is iiioit or lc,ss incidental, to whom a system 
of jveighting would seem somewhat artilicial and .arhitiaiy. 
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In either event, the necessar)' computations are identical. The 
precise procedure described in the present paper, like that devel- 
oped in the writer’s previous study (loc, cit.), cannot be applied 
when the equation which is to be fitted contains more than one 
undetermined constant. However, in actual practice it is usually 
sufficiently accurate to substitute the square of the observed value 
of the dependent variable for that of the corresponding most 
probable value in equation (9). If this is done, the method can 
he applied to any equation which can be fitted by the method of 
least squares. Using this substitution is equivalent to expressing 
the errors of measurement as fractions of the observed values of 
the dependent vaual)le when the standard error of each measure- 
ment is proportional to the quantity measured. 
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POPULATION PARAMETERS 


T. Introduction. 

It frequently happen.s that all of the individuals of a sample of 
statistical data 'from a multivariate population are not 'observed or 
classified with respect to all of the variates. If a sample be repre- 
sented in matrix form by allowing the row,s to represent the in- 
dividuals and the columns to represent the variates, then the matrix' 
of the tyixj of sample with which we are concerned is incomplete 
in that some of the elements are not present. As an example of 
a fragmentary sample of this nature, we may consider a series of 
mea.surement.s taken from certain parts of a group 6i human 
skeletons from some archeological find, in which some of the parts 
under con.sideration are missing from some of the skeletons. 
Again, we find such a class of samples in the social sciences and 
government statistics arising from incompletely answered question- 
naires. 

In dealing with fragmentary samples, it is important to have 
at hand techniques which will enable the investigator to extract 
as much infotmation as iwssible from the data. This is especially 
true if the data are unique or expensive. An important problem 
in this Connection is that of estimating the population parameters 
from the sample. 

In this paper it is the ixrrpose of the author to investigate in- 
complete samples from a normal bivariate population. To be more 
specific, samples are considered from®a normal bivariate popula- 
tion of X and y , in which s of the items are observed with 
respect to x> and y , rn with respect to x only and ri with 
respect to y only. In the first part of the paper we shall con- 
sider various sets of simultaneous maximum likelihood estimates 
of the joopiilation parameters and the limiting forms of their 
sampling variances , and 'covariances in large samples. In the sec- 
ond part we shall consider other less efficient, but simpler systems 
of estimates. 



11. vSinuiltancoU', ICstinwtiim In- tbe i^rotliocl of Maximum 
T.ikclihoorl. 

Let a cample oj of rV individuals be drawn from the popula- 
tion of the two variates and y whose di.strihution i.s given bv 

1 

ZTicx/SyVI-r^ 

where a and b are the means,' and the .standard devia- 
tions andr the corrdaiioii of x and y in the iKJimlalion. Let 
he the set of 5 individuals of this sample f)ljaer,\cd with re- 
spect to andy , the .set of ir? items ohseived with respect 
to z only and lOy the remaining n item.s observed with ve.spect 
to y only. To avoid trivial results we shall asMinic that s is not 
zero. Furthermore, we shall let t and T) be the varianees'and 
S’ the convariance*, le and y the means of .r: and y in<4y 
The variance and mean of z m uJ^ will be denoted by u and 
Z, respectively, and similarly, the variance and mean of y in 
Qy will he V and y .The joint distribution of x , p , 2^ . 

, , u . v^ . f . J 7 iind S can he written from several 

well known independent distiihulions a.s 




(X-aJ 




(y- i>) - 

/r^ 


Zr(x-aXv-t>} 






-I y 


X V J _ X 


wheie 


•Zr 


'X 

ri’i 


U^v 


K- 


„ „ m+n-tZs 


Wli r('°f‘jr( ■^}rl VyiW) 


'll, fill tvjo sets of viliiales, • • •f/r?' ^st/Fu, • ‘ ' 

W<.‘ let anti 

then ^ and are the means, y, and the variances ■' 

covariance of the two sets off's. , , 
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The likelihood of to when tx) is si>eclfied in terms of the 
forcgoinc; statistics is _c;iven by (1). We shall use this expression 
i)f the likelihood to obtain approximations for the maximum like- 
dihood estimates of the population parameters r , a 

and b . Followinp: Fisher'-, we .shall take the loeiai ithiii of (1) 
and denote it hyL. For eoinenience, we shall, once for all, set 
np the following set of first derivatives. 






bb <ry 


(y-b) /S(y,-b nZ-a) 




[2)(eJ 


bL _ 3 
'xjL 




-(!*■ 


0)i( I- r^}\ 


bL 5 

(■64) — * — 


, ^ . /f ^-5 

‘ ' n X? y' ^ J 


I / fj r ? \ fiy 


(e,) 


br br^ 


r+ 




J/ at 


.££L) 


where 


= t->-(Z'a)^ , f) =.ryi-(y~b)’' , a~u+(ic,-a)^ 
V - V b)^ , l*(l-aly-b), 


‘ hi order to' consider the limiting form of the .sampling vari- 
ances and covariances of the maximum likelihood estimates, we 
shall need the matrix of iiiatheniatical cxixictatiuiis of the second 
devil atiies of L with respect to the five population parameters. 
Tills matrix of expected values turns out to be, 

-R. A. Fibhcr, The Mathematical foumlations of thc(]ictical stati-sfe. 
Philosophical Transactions of 'hi! Roval Society of Tondoii, Vol. 222 (1922), 

in>. .509-.16H. 




where Z) is identical with a , ^ , and r as / takes the 

values 1, 2, . . 5 respectively. Again, << and ^ denote the 
ratios ^ which we shall consider constant as s -«■ «> . 

The maximum likelihood estimates of any number of the five 
population parameters for given values of the remaining paramet- 
ers are to be found by setting the corresponding first derivatives 
in (2) equal to zero and solving the resulting equations simultanfe- 
ously. In most of the cases of practical interest, the solutions must 
lie reached by approximation. In this paper we shall consider the 
following cases : 

1. Estimation of a and b for given estimates of 
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and r . 

2. Estimation of and for given estimates oi a , b 
and r , 

3, Estimation of cr , and r for given values of a and 

X, y 

b , 

Before proceeding with the maximum likelihood estimates of 
these parameters we shall consider the notion of the efficiency of 
a set of statistics designed to estimate a set of population para- 
meters, 

1. Joint efficiency of a set of estimates. 

In order to attach an economic value to a sample and its in- 
dividuals, Fisher" has defined the reciprocal of the variance of a 
maximum likelihood statistic w of a sample from a univariate 
population as the amount of information contained in the sample 
relative to the population value of w . For large samples, in which 
the distribution of w tends to normality, this quantity is a con^i 
stant multiple of the number of items in the sample. The amount 
of information contributed by each , member of the sample can be 
found by dividing by the number in the sample. 

We can extend the idea of amount of information relative to 

( ( 

,a system of population parameters contaiined in a sample by con- 
sidering the reciprocal of the determinant of the limiting values, in 
large samples, of the variances and covariances of the maximum 
likelihood estimates of this system of jxirameters. This extended 
definition also holds for systems of parameters estimated from 
multivariate populations. 

The reason for adopting this determinant as the extension of 
the idea of the amount of information relative to the set of para- 
meters under consideration, is apparent when we note that the 
square root of its reciprocal enters as a multiplier in the asymptotic 
normal distribution of the maximum likelihood estimates of the 
parameters in the same way that the square root of the reciprocal 

“R. A. Fisher, Statistical methods for research workers, third edition, 
Oliver and Boyd (1930) pp. 266-270. 
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of the sampling variance of the maximum likelihood estimate of 
a single parameter enters as a multiplier in its asymptotic normal 

distribution. 

Fisher‘ has shown that for large samples, the maximum like- 
lihood estimate of a population parameter is distributed with small- 
er variance than any other statistic designed to estimate the same 
parameter. In the case of a set of parameters, the determinant of 
the matrix of limiting values, in large samples, of the variances 
and covariances of the maximum likelihood estimates of the para- 
meters is smaller than that for any other estimates of the same set 
of parameters. 

To prove this, let us consider a set {Pi] . 
of population parameters, and let the set ^ j be their maximum 
likelihood estimates, whose sampling distribution for large samples 

where H- I hij\ , where zni L is the 

logarithm of the likelihood of the sample, n is the reciprocal of 
the matrix of variances and covariances and covariances of thef's. 
Let the set | be any set of estimates of [ p, | in which at least 
one a is not a maximum likelihood estimate, and let the asymp- 
totic normal distribution of the u’s be. 

(zn)^ 

where H- 1 | , which is the reciprocal of the determinant of 

the matrix, of variances and covariances of [u,] , Now, olii 
problem is equivalent to that of showing that H>K, Suppose 
there is at least one set of estimates of [ P/ j containing at least 

‘R. A. Fisher, The theory of statistical estimation, Proceedings of the 
Cambridge Philosophical Society, Vol. 22 (192S) pp. 70O-72S. 
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one estimate which is not a maximum likelihood value, such that 
the reciprocal of the determinant of its variances and covariances 
is greater than or equal to H. Let this set be { | . Then, by 
hypothesis, K-H. ^ 

Let T be any linear transformation pure 

rotation of the axes representing Ui-p,- ( • n) about the point 

PrPi. ' Pr, , ^ii (^rPi y^J-Pj) 

to a sum of squares, . Here we have 

a/^ and pj) , where is, the cof actor of a/^ 

in lo^yl . Then is the reciprocal of the variance of the 
variable u^^tb;.U; about its mean value and 

Tf k.-K I since the determinant a,-, of 7” is unity. But 5 

■ts/ ^ \ ij i 

is not the maximum likelihood estimate of , since at least one 
of the u’s is not a maximum likelihood value. As a matter of fact 

n 

L b;. t. =t, , say, is the maximum likelihood value of 5, , for 

dL ” dL dPi n dL 

Z — TTT =23 a.ri — 
dp^ t-i dpi bp^ 1=1 ^'^bpi 

n 

vanishes only for p^ = , that is, for /d^«23 b^^t- ; (provided 

we assume that 0(i=lZ, • • ‘ 75 !) has the unique solution p. = ). 

It follows from Fisher’s" proof for the’case of one variable, 
that the reciprocal of the variance of is greater than k^. 
Hence, U p. > U k,. We note however, that the maximum like- 
lihood estimates { /^J are not independent, for their distribu- 
tion is 



— // 

where , which is not necessarily zero for 4 f /d . 

The effect of this non-independence is to introduce a term ^ as a 
multiplier of , where is the determinant of correlations 
among the ?'5, and is less than unity. Hence, 77 y n a y 

O^s/ 7Z flt*/ 


‘R, A. Fisher, loc. dt 
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It is_well known from the theory of quadratic forms that 
the matrix \\^^^\\ is found as the product || 5,^ || • || h^J\ ■\a • || , 
where ||5/^|| is ||a^^ || with its rows and columns interchanged. 
Since the determinant \c2^y\ is unity, it is dear that I h I 
which IS equal to ^ , has the value | \ which is H by 

definition. Therefore we have H>K, which contradicts the hypo- 
thesis that K^H. Hence, we must have K< H . 

Thus, the proposition is proved that the reciprocal of the de- 
terminant of variances and covariances of the maximum likelihood 
estimates {t, j is smaller than that of any other set of estimates, 
all of which are not likelihood values. 

We are now provided with a means of measuring the joint 
efficiency of a set of estimates in utilizing information in the sample 
relevant to the iX)pulation parameters estimated by the set.. We 
shall take as a measure of this efficiency the ratio of the reciprocal 
of the determinant of its variances and covariances to that of the 
set of maximum likelihood estimates of the same parameters. 
This quantity is less than unity, as we have just proved. -The ef- 
ficiency of is therefore 

A- 

Sff “ ^ . 


2. Simultaneous estimation of cz and b . 

We shall suppose that satisfactory estimates have been ob- 
tained for c^, cfy and r . If they are to be taken from the 
sample u) , we can take a/ as the variance of the in and 
as that of the y's in co^^y and and r from ui^y . 
In any case our problem is to find the optimum values of a and 
b for given values of cr^ , cfy and r. These values of * 
and b are found as the solution of the equations obtained by set- 
ting (e,) and (e^) in (2) equal to zero. Accordingly, we find. 
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<rjl-r^) 


(y,-v) 



'['he matrix of the variances and covariances of d and d 
III samples is olitrtiiied by taking the reciprocal form of the two 
way piinciiral minor in the upper left corner of the matrix (2)" 
Titus we find, 




aD 

sD 

ra^c^ 

cr/ [UK(J-r^)'] 

sZ> 

sD 


* where r 


We note from (6) that the va'riance of a is 
a^[l*l3 fj-r^)] ^ 

and (i similar expression holds for Jg . The correljttion coeffi- 
cient of d and p is 

^ ^ r ^ 


_ "See Karl Pearson, On the influence of natural selection on the vari- 
ability and correlation of organs. Philosophical Transactions of the ^oytil 
Society of hondon, series A, vol. 200 (1900), pp, 3-10. Here Pdarspn gives 
a method of obtaining the variances and co-vananccs of the variates in a 
iiormal multivariate proUibility function. 
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I^roni the definitions in section 1, we find that the amount of 
information in oj relative to a and t> is the reciprocal of the 
determinant of (5). That is, 


(7)' 


A(m,77,s)‘ 


S^*s(m+r])+ rrnid-r^) 


From (7) we can find the relative amounts of information 
contributed by members of and u;y by means of dif- 

ferences. For given values of v and s , we have as the informa- 
tion contributed to to by an milsi individual of , 


(7a) 


A^Jrrn-l)^A(md, n,s)-A( m, n, ts)= 


s+nfj-rd 
a^a^(J-r^) ' 


which is independent of 777. A similar expression holds for the 
Tii-Isi member of cOy. 

For the s-*'Is{ member of , for given values of m and 
77 , we get 


(7b) 



(sd)= 




It is clear that an additional member to to^^ is more infor- 
. . - . ^ 

tnative than one to each of and tOy by an amount ^ 4 ^, 

or, considering the ratio rather than the difference, we have 


A(md, 77 /i, 9 )- A I'm, 77, s) 


r^f m+ri4-l) 
Zs* m-fn-tl) 


We find that the amount of information introduced by and . 
t^y is A(nm5)-A\{()Qs), which is its 


ratio to the total information (7) is 


4 s(m+ n)* mn ( 1 - r^) 


1 - 
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3. Simultaneous estimation o! and <Dy. 

If we suppose that r is given as well as a and /b , we can 
find the optimum value of and by solving the equations 
obtained by setting (e^) and (e^) in (2) equal to zero. Ac- 
cordingly, we get, 

/o) ^EfEF-'GV _ . 

^ ^ ZEPfl -/3h F4- BFG ^CU<XF/5) + 6'*('’(^ -fi ) ^ 



Z EF(F0) -GV/3-^)-^ V4EFG ^(J^-^)(U/3) 


where 

r ^ 




T-r‘ 


and G~ 


r S 

i-r^ • 


The variances and covariances of b and are given by 

X y 

the reciprocal form of the matrix obtained by striking out the last 
row and column from the third order principal minor in the lower 
right comer of (3). For the variance of , we find, 

A similar expression exists for . The amount of information 
yielded by cj relative to and cy under these’ conditions is 


(9) 


A'( n!^n,s)^ 


('^77 3 S.1- £ r [ 5 m I* S r? ■/ £n7j7^ 


From (9), we find the differences corresponding to (7,a,b,c) to be 
(9a) /I' (m,t) *l=iI!>-?r‘(=*Zr,) 


(9b) 


A,j) 
xy 


a* (1-r^) 

0s + 4{ rTnrn-i)-Zr^ ( m-/-n ) 
(1-r^) 


a' (■m-tl,nM,s)-Afm,r7,s) f^ym-fP^Zs+2) 
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4. Simultaneous estimation of cr cr and r ■ 

fC y 

Let us suppose a and b to be satisfactorily estimated. For 
large samples, a and b can be estimated from the sets of a;'s 
and y's obtained by pooling ijj and to . Whatever e.sti- 

x^y A* y 

mates we may choose for a and b , our problem is to solve the 
equations obtained by setting . (e^) and (e^) in (2) 
equal to zero, for , Oy and r . 

If we denote the quantities in the brackets of (e^) , (e^) 
and (Ej) by / , ^ and h respectively, then we are to solve 
the equations f =g h--0 iov and r . The method of 

elimination seems to be of little value in solving these equations. 
Then we shall use the extended form of Newton's approximation 
method and find an approximate solution. Considering nothing 
higher than the first order terms of Taylor’s expansion Qif,q 
and /7 we have (letting a , o^=y, ), 

= 0 

( 10 ) • g, + ('>i-x)g^+(y-y^)g^^*(z-T£,)g^ ,0 

h,+h- ajhy 4.(y-y^}hy+(it-z,) h^*0 

where ^,'f(x,,y,, n,^), and so on, 

We shall take for the initial point, , y = and 

H,= , which are, for rn^ri^O and a=x, b^y , the maxi- 

mum likelihood estimates of ov , or, and r from a) . 

X y 

Solving equations' (10) fora:, yand z by Cramer’s rule we 
find ‘for the first approximation beyond the initial values, 
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/flv <u 1 , s „ 

where D- 1 / 7 -/? 7 ana P=_J^ ■ By «sing the 

point whose coordinates are given by (11) in place of the initial 
point in (10), we find a second approximation point, and continu- 
ing the process we get a sequence of points, Such a sequence 
would raise (]uestions of convergence which will not be considered 
in this paper, However, it can be shown without much difficulty 
that the likelihood of the point whose coordinates are given by 

(11) IS greater than that of the initial jwint for variations of u 

and y about ? and rj respectively, and for •= f and v = f) 
the likelihooils are equal, Indeed the problem is equivalent to 
showing that tlie ratio of the likelihood (1 ) with the values , 
^ and for cr^; , cr^ and r to the likelihood with 

the values given by (11) for a. , cr^ and ‘r has a maximum of 
unity for variations of d and v about ? and fy , Thi^ can 
be readily done by examining, in the ordinary manner for maxima 
and minima, the first and second derivatives with respect to u 
and V of the ratio of these likelihoods. 

The matrix of limiting values of the sampling variances and 
covariances of the maximum likelihood estimates of and 

r can be obtained by taking the reciprocal form of the third order 
principal minor in the lower right hand corner of (3). This reci- 
procal matrix, is, , 

(12) 

JsE XsE ZsE 

r^a^a^d-r^) a ^ f r*)) ray(l-r^)ii*y,(l-r*)) 

' iie ’ TsE Zse 

r^U- r^xued-r*)) r<s^ 

ZaE ZsE ZsE 

where ( I- rV - 

* The amount of information in oo relative to , Oy and 
r is the reciprocal of the determinant of (12). Denoting this 
([uantity by 5 (/tv, ? 7 ^s), we have. , 


5, S' WIIKS 


177 


4 

(13) B(m,v,5)-- +rrtn3(l- 

Proceeding as we did with (7a), (7b) and (7c), we find the 
following incrernental contributions • 

{\Z\))3^^^(5-hl) (1-r^f \3s’^+3s-fl4-(Zs+lXmm)*mT)(l-r^, 

( 13c) B( m^^^, s)- B(m, n, 3)^ \zs^-i-5lmmlXl-r^ 

X y 

( 13d)j5( w, 77 , s)- B(0,0,s)^ <3^ 0^(l-r^B 

We note that the s*lsf member of co^y is much more im- 

portant than an additional item to each of u)^ and u)y when , 

(jy and r are considered, than when a and b are considered. The 

amount of information contributed relative to r by u)^ and ctjy 

can be found by differencing the reciprocal of the element in the 

lower right corner of (12), with respect to m and n. If we call 

this reciprocal 7^ j), we have as the ratio of the contribution 

of information by uj^ and w,, to the total information in oj re- 
, '* / 
garding r, 

(m,7i, s)-K^JC>Oy s) z s(m4‘ii)i-rmn(l- r 
Kfg -t-s ( rni-n)-i- mn (1- r'*) 

In a similar manner, we pan find the contributions of the various 
parts Cl) , oL and tJ to the information relative to any one 

xy x> y 

of the parameters and r and we can find their effects 

upon the covariances of the maximum lilcelihood e.stimates by con- 
sideting the non-diagonal elements of (12). We find tliat the in- 



178 


rOimUTfON PARAiMUrUKS 


formation afforded by relative to cr^ expressed in terms of 
the total amount of information in and uJ^ regarding 

' r'^sn . 

s’‘*s(m-i-'n)4Tnn(l~r^) 

We remark without going further, that, by considering the 
five equations obtained by equating each of the expressions in 
(2) to zero, we can find approximations for the maximum like- 
lihood estimates of a^b, ci^ , and r by the foregoing method 
Since the process is straightforward, though somewhat cumber- 
some in that it involves fifth order determinants, we shall net 
consider it here, 

III. Systems of independent estimates. 

We have seen that the problem of finding the maximum like- 
lihood estimates, of a,b, cr,, and r from the sample leads 
to expressions which are not very simple, especially from the point 
of view of practical application. However, the variances and co- 
variances of these estimates were found to be relatively simple. 
In view of the difficulties connected with the foregoing maximum 
likelihood estimates, we shall devote the remainder of 'this paper 
to a consideration of the moments, distributions and efficiencies of 
simiiler systems of estimates. 

If we are interested in the means of the *'s in u) ajiart from 
any contribution of the y’s, the optimum value of a is S^-r-r^ ■ 

-my I 

Similarly, for the means of the y's, we have . The 

best estimates of the variances O'J ' , and o’y* under these con- 
ditions are, 

where N, » s* m, and N^=s*rj. 

For the covariance we shall take the product moment of the 
deviations of the z'b and y’s in from 2^ and respective- 



i s n'lJ.K's 


IW V 


ly. That is, 

5,= 1 1 -V( y, -yj - ) , 

From these values, we can take as the estimate of the correlation 
coefficient, 

J ■ 




from 


I. Distributicm of and ^ 

The variances and correlation of and y' can be found 


-{Wv5 


' xJTF) 




a-a)^(y-bf Zr(x-a)(9-bj\ 


O^dy 


/C 


(yrb) 


by making the substitution , 57 ,,^^, and using 

determinant analysis’ on the symmetric matrix of the resulting 
quadratic exponential. 

The variances of and are found to be ^ and ^ 
respectively, and the correlation between and 
The exact distribution of and y, is normal. ‘ ^ 

The amount of informatidn relative to a and b furnished 
by 2^ and is, according to our definition, 


( 14 ) 


( S* m)( s* n) 


0^0^ 
X y 


-2«-« 


r 


(rrr-i-sX n-f-s) 


•] 


The efficiency of a;, and 34 is, therefore, the ratio of (14) 
to (7), which is 

sf(msfiJ- r ’^) 

[( m+^Kn+s)' m nr [f 


’Karl Pearson, loc cit. 
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2. Characteristic function of and . 

, The characteristic function or - generating function o! the 
moments of ^ and 3^ , which we shall denote by 
is defined as the mathematical expectation of t? , Since 

and \ are expressible in terms of ^ , y , 

^ , 17 and "S , whose distribution is given by (1), then clearly, 
we can write, 


(15) (p(r.S,£) = Je f7dV, 

Where F” is given by (1) and dl^is the product of the differentials 
of the variables in F, and the integration in taken over all possible 
values of the variables. 

The integral (15) can be broken into the product of a con- 
stant by a quadruple integr,al, a triple integral and two single ift* 
tegrals. The quadruple integral is of the form 

/ / / ^ df,di^dijCH^ 

-•w 

which has the value" - where Zl is the determinant | \ 

) and bij The triple integral is of the form 

/ ^ d%dxdy 

•'o •‘o -v5y 


which has the value” , , 

-dnr(¥)r(^) 


s-l 

-r 


"Karl Pearson, loc. cit. 

"See V, Romanovsky, On the moments of the standard deviations and of 
the correlation coefficient in samples from a normal population, Metron, vol. 
5, no. 4 (1925) pp. 3-46. 
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Each of the '-inRlo intcarals is of the well known form 

' C'^2' 


/' 


' dt 


1 

which has the value 

Usini? the above results for the intesials into which (15) 
resolves itself, we Ret, 


^ r ~ d 

\ 2. / A JL ' ^ ‘ ° 


nJ. 


^(aBcV ji} ^ 

^S-J 

x[( 4 - ^ )a- ^J-(C<£>’] 

/ r^m^n ^ tnn \ iC r/7??7 _ ,1'd /nvr 

'"IP ^ /V/ 


wl«ire /)-j^ . B-£r ■ 




r 


Za^a^(J-rV ’ 


If ue write 

we find the following expressions for llie first few moments of 

?. . ‘'"1'* 5,, 

M{ip,0)=^ ’ M(0,L,0}= Oy,' 

/ S-1 . m T7 \ 

M(O.OJ) ro^ Oy 1 “ sNp^z / , ■ 

i 

M(txoy 'i0p,ki,,iXN.-i)£zy(^*^-i\ 

\ 

{\7) H(1A1)- 
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)_^Va/,"-/; 


Mf^Ao)^ ^-L- ;^ :.. , Mfo,zA)=~~- 




M(0,0/) = 




(l+r^Xs-l)^r^ 


w 


PIT? 9 

^ ^ p<t 


Nr Nl N, N^ 


'(^ri 


M(W)- 


raJcTy [4^s-lXl*r^) , Amr? 
N,N, 


A mn / , mn , i 

4 \ 

/V, ( N,N^ J 


Zr / mn 


/42 *s.,pl^. Szl )/„ V V A/- N- )] 

' NfN^ J ^sN, Nj, 5 /i ^ ^ J 


If the sample ui is fairly large, we can neglect the contri- 
butions of the means a , ^ , 2,, and y, to \ and con- 

sider as satisfactory estimates of the variances and covariance, 


1-i-K ' 




-3 


3. Characteristic function and sampling distribution of 
Oc V 

It is clear that 5,, Oc 3>id are obtained from , rp^ and 
3^ by dropping the terms involving the means x , y , x, and 
Yi . The characteristic function ^(Y,6^£)o{ these statistics can 
be obtained from (IS) byreplacing ty, and S,, by and 

and integrating. The integral in this case will not involve the 
cpiadruple integral, but only the triple integral and the two single 
integrals, Accordingly, we find 

which IS somewhat simpler than (16). 

The first few moments of ip, and evaluated from 
^(r:S£) arc (using the notation of (17), 
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/V, 


Mmi)^ 


M(1X0> 


N,N^ 


2 r ( N, -Z)(N^ - Z ^ j 


MdAlh , MfOXlh 


18.3 


MS) , , a^(N,-Z) . a^t'^z-Z) 

' ^ MfZAO) = -^7^ MfOA 0) = ^StT— 




a/dyts-DfUr^s) 


MfWh 


(s-l)rcrjay 

~AN^ 


Z(N, * NzZ)*Z(s*t)r^(NrZ)(N^-z') 


In order to find the exact sanipliii" (li^trihution of f « 
and ^a, it is more convenient to consider the statistics , 

characteristic function of these sta- 
tistics is found from 'A(‘Z’,£,Z) by replacing ^V by if^ .^efby 
8^ , and f by <f, . Thus, we have 


(19) ip, (r,,6,. ^(A,B,-C^Ur ' 


where A,, B, y A, s 3, and C, arc the constants A ,3 , A ,B 
and C each multiplied bys, and a» and c« , 

The distribution /■( 5) .fPi.S, )of , rp^ and 3^ is then, the solu- 
tion of the integral equation, 


, P c 



1H4 


POI‘l'l..tlIO\’ P IK-I,\jn'l EPS 


\Vi' note from tlio fju'tor (A,-f/ ) can bo 

wiittiMi as fil<ewise with respect 

For sufficiently Miiatl values of and S, these terms can 
1)0 umiosonU'il by seiios o\pansions. It will lie convenient to le- 
arran'^e the I'rorluct of these two series in n power scries in r*. 
Expaiulinij: and airan.i'ins; in this imnuer we get 


r// i3r<r, 


Each U'lm of tins expansion is of the form 

( 22 ) 

where is a constant independent of and <f, . 

W'c are now in position to find f( ),is a series of 

terms f,, >7^, ^i) whose form is given as the solution nl 

The integral eiination ( 2 . 1 ) can he solved by the methods used by 
Ronianovskv"' Following Uomaiiov.sky we find that 






an even (unction o(t;^Siv-0 say, which satis 

fies the condititm 


"'V Romaiiavsky, loc. cit. 
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(25) 


’i-1 


r(g^i)r(c^g) 

r(Q^^ci-g)r(b/c+q)r(^)r(c) 


iox q ‘^Oj, 2, • • • and u) (t) is independent of g . 

To solve (25) v^re observe that the right side can be written 
as 


(26) 

C t 




t>u-l 


dudv , 


/ 

where//* 7q^jr(cjr(EiJr^2^ • The^-th moment of f^is now 
identical with the^-th moment of the product uv. Since cJ(t) 
is even, we have, 


(27) jt^^^oiWdi^-LMq . 


Setting V - , ofv' =~di in (26) we find 

(28) (d(i} (1-u) ‘da 

Jt2 


Making the transformation frl^e =G,'nt finally obtain. 


(29) 


a-t^) 




r(i)r(c)r(a.*b^fc-i} 




where the/" function is the ordinary hypergeometric series. Us- 
ing this form with t replaced in (24), we have S^) 

fully determined. ' ' 

The complete solution f ( ), (20) ca” ^ found 

by summing all of the expressions of the form (24) whose char- 
acteristic functions appear in the sum (21). Without much dif- 
ficulty ive can sum this series'by ^pressing the coefficients as beta 
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functions and interchanging the order of summation and inte- 
gration. Accordingly, we can express ^/) in closed 

form as 

, ^ b+c-l z a+bic-l 

m r, 0, (1-h) 






f' r' a-tc-f b-t ^2 ,-<7 , a 

/ jd^x) X [Wl-f-)4 


0 c 

X e 


' TP/ 


y»here 


K= B, ^ (A, B, -O' 


r(i )r(a)r(b)r(c)r(c-^) 


The distribution of f)^ and can be found by the 
change of variables T/ = j 

that our estimate oi ti’® correlation coefficient can 

range in value from . yv^W^ to • 

^ 4. Moments of f)^ and 5 ,, when r=0. 

The general product moment A'/ ..H ob- 
tained from (30) for r^O is extremely unmanageable and im- 
practical, since it is a generalized hypergeometric series expressed 
by a quadruple summation. However, for r=0,M(h,k,t) is quite 
simple. Indeed, for this case, we' have, 

(31 ) M(h, k, {) == i 

, b^c.R- i ^ ja^bi-c-i 

•>, y, /I 

. 9^ b-Jr “H/j 


Mfh, k, t) exists for all positive values of h and k and for all 
■positive -integral values of /. Since the integrand is an even 



function of it follows that M(/?, k, f)’0{or t an odd integer. 
If we let set = (31) and make use of (25), we 

find, 


, . A-h-TJa~k~-0 


(32) 


Njn^ 

rtbH:^-M)rta+c*h+v')rt\/+^)r (c-tl/) 


' rfc) r(i ) ' 

The2i/-th moment of the correlation coefficient can be found 
from (32) by letting Thus, 

52 / r(a+c-t ■k)r{b*c*i/)r(c)r(j^ ) 

The variance of is ’ which does not dif- 

fer appreciably from ^ , which is the sampling variance of r 
when it is computed from uJ^y . The distribution of r„ is found 
by setting in (29) and multiplying by rta*c) 

r(b+c). ' ^ 

The 2r/-th moments of the regression coefficient of y on 
a;, p • |p say, is 

and the variance is 

. g-/ 

(k,-zXNr4)s^ ' 

t cr* 

This differs very little from the variance of. the re- 

, X/ 

gression coefficient using only the data from cJ^y . 

Slightly more accurate estimates can be obtained ^ < 
Oy and rcTy^Oy by multiplying 5, and hy 
and 5 ^ respectively. These corrected estimates will have their 
mathematical expectations identical with orf , a* and r£^ dy , 
as will be seen from M(i,0,0), M(0,l,0)^xA H (0,0,1) in (18), 
In this case, the general moment M( h,k,Z'\J)'^'^ be identical with 
(32) multiplied by 

//V) jY s 
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' j 

The variance for in this case is jzj , and that for the re- 
gression coefficient is 

(NrZ)a^ 

5. Variances and covariances of and in 

large samples. 

( 

As we have seen in the last section, the product moments of 
, r)^ and evaluated from (30) are too complicated to be of 
much practical value, and there is not much hope from this source 
of finding the sampling variance of the estimate of the correla- 
tion coefficient The moments and variances of and 7^ 
taken separately are well known results. In fact, for large sam- 
ples, the variances are and ^ respectively. The variance 
of is not so immediately obtained. We shall find its limiting? 
form tor large samples from the normal form approached by 
the distribution of and to as 77 and s approach 

in constant ratios ^=<<. . 

For convenience let ^ and = t . Then 

we have 

If we integrate (1) with respect to 5 , y , and 5 ? and 
perform the following transformations on the remaining part of 
the distribution, 

;f 1 ^ V 

,, d.rf^Z,<P(Ul3)d(fi 

dS-e<f.dt, 


we can write it in the form, 



s. s. wnjcs 


I8<3 

(34 ) F(u,v,e^4>,t) ^/'u K fj\ \ 


where 


F(u, 4 C(U^XFm~F) "e ' 


m-j’Ti-tS.s 




:in(l 

C = 


77 ¥cr-^-'""V 

^ y 


nnd 

j:, , / d'i 

i(a,if,6^4\ i) ^(l-F) e ^ u 


z 

V 


z 


X [("/^^ uXl+dp V-/j-5" % V 1 ^ 

I 

J nT^ig^-r'^Ha Zrt6<P' \ 

X e'i(l-r^)\_ J. 


If (34) were integrated with respect to u and w, we would 
;'et the distnlaition of 

> wheie the distribution of , 77 ^ and ? is given by (30). The 
■■ prolilem of finding the asymptotic normal form of the distribution 
of e? , ^ and t from (30) seems extremely complicated; How- 
i'\er, we can find this asymptotic form by first finding the limiting 
I'oimal form of the distribution of u. v, 6 ,tp , and t from (34) 
and then integrating with lespcct to u andi/. 

Tlie limiting normal form of (34) can lie found By methods 
developed by vcm Mises” in a paper which appeared in 1919, In 
fact, satisfies all of the conditions of the generaliza,- 

tion of his first. theorem to functions of more than one variable. 
In ])articular, the first order partial derivatives vanish and the 

OR. voii Mises, Pundamcntalsatze tier Wahrschemlichkeitsrechnung'i 
Mathematische Zeitschrift, Bd 4 (1919) S. 14-18. 
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determinant and all of its principal minors of the negative of the 
Hessian are ])ositive at the ])oint P whose coordinates are u- = o^^ 
y=<^y, Furthermore, is identically zero 

outside the region of possible values of v , 0 , (p , and t. 

The matrix of the negative of the second derivatives at P is 



if 

if 

bf 

bf 

bf 


da 

dv 

be 

w 

di 

bt 



m 

^/Sr-’ 

A r 

6a 

mu 



bf 

rU/s 




jSr 





if 

be 

(^*%) 



■^^ypS-t) 

-ic. (1^±. ) 

ayP>' l2*P‘/ 

bi 

bp 


'<r^ ^ ^ 

-xF ' // 

OyOyP^ 


ayp^(x p^J 

bf 


tSr 

a^p>{z*P’J 


JiF 

P* 

bi 

app* 

O-yP 


Now it follows at once from von -Mises’ theorem that 






wiiere x^^v-cr^, . cp-a^ 

and hij is the element in the i -th row and j -th 
matrix (35) Now, 


r)> 


(z rr)i<xpcxp (t-r -f ; i 


and Xy^rt-r , 
column of the 
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which ih equal to ^ , where/? is the cletermmant \hij\ . 

The variables in which we are primarily interested are 0 , 
(f ami t . The matrix of \ariances and covaiiances of 0 , ^ 
and i is formed by taking the third order matrix in the lower 
right corner of the reciprocal form of 
turns out to be 

(37) . 


/? 


ij 


This matrix 




erf 


raJl+^-r^) 

zsW^ 

Zs(U<)(U/3) 



^y' 

roy(l+4-r^) 

Zs (i+<*,Hl+i3) 

Z 3(1-1-^) 

2s{/yci)( l+fi) 


ra-yf'jM-r^) 

s{Uk)(U/5) 


Zs(Uc(,)(U/5) 


&\ 


The determinant of (37') is . 
cr/cr/ \(l-r +i3)(l‘r ^)*<0(UrV\ 


The variance of is given by the clement in the lower right 
corner of (37). It can be readily shown that this variance is 
greater than , the variance of the estimate of the correla- 
tion coefficient from only — a rather surprising result,, 

The efficiency oi Q ,(/) and t taken jointly is the ratio of 
the reciprocal of the determinant of (37) to B(m,n,s) in (13). 
That is, 


{Z9)Eff(e,^,t)-- 7= (^+0) T^ll' 

[(I+’i )(Uff)-45r^ \(i- r (-i 

which is less than unity except for the cases and . 

6. Efficiency of the system d , <p and . 

If we use ^ , say, which is the maximum likeh- 
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hood estimate of r from instead of in section 5, and use 
the foregoing analysis of von Alises, we find the following matrix 
of variances and covaiiances for the asymptotic normal distrihu- 
<tion oi Q 'i <p and T/ : 

(40) 

6 <p n 


e 


(p 


n 


(y^ 




Zs(U^)(U/3) 

Zs(l4<Z) 


T-® 

rOyfl r^) 


Zs (U/S) 

zs(l4/3) 

ra^ (J-r^) 

ray(J-r^) 

(1-r^)^ 

zs(ippy 

ZS 



s 


The determinant of this matrix is 


(41) 




whose reciprocal provides us with the amount of information re- 
lative to and r yielded by the estimates G, (P and . 

The efficiency of this system of estimates is given hy the ratio of 
the reciprocal of (41) to (13). that is, 




IJy comparing the .systems & yP , and G yp » /7 , we actu- 
ally find the latter to be more efficient, since 




which is the ratio of the reciprocal of (38) to that of (4l), The 
equality holds ony when r^O ox 
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The distribution ffz,wJJoi can be 

readily found from the distribution of f , 77 , T , and i/, which 
is included in (1), by making the following sets of transforma- 
tions in succession, 


(a) 

d?^Jfddi, 

’ 

df’‘ 

(b) 

1 


dr^-l+ddw 


! 

dU‘ - zdB 

w(l-(P} dv* ~ ^ Wd(p.^ 

The result can be expressed in closed form as the definite 
integral 



S-4 sirrT-4 S-H1-4 

( 42 ) i(ii,w,thK(Hn 

/ ; ^ mil ^1 ^ 

^ [e'^ 6 ^ (l-d) 0 ( 1^0 dddip. 


N,-2 dfd. A 'It 

gy a-r^) ^ 

r{{)r{^)r(^)rmr(^) ' 


I 
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Zri )/N,N^e0'\IA 




When r^O , (42) breaks into the ])n)diict of thiec uell 
known functions, two of which represent the distributions of the 
variances in samples having stm-Z and sin-Z degrees of free- 
dom, and the third which is the distribution of the correlation co- 
efficient in samples of s items from a normal population in which 
the correlation is zero. 

IV. Summary . 

Samples are considered from a bivariate normal population 
of X and y in which all of the members are not observed with 
respect to both x and y . Such a sample is broken into three parts 

, 0 ^ and u)y , where is the set of $ members observe^ 
with respect to both x and y, lo^i, the set of m members observed 
with respect to x only and the remaining items observed with 
respect to y only. 

Maximum likelihood estimates arg found for the following 
sets of conditions: 

(a) For given valiies andr, optimum estimates 

are found for the means a and b . 

(b) For given values Q{a,b and/’, optimum estimates are 
found for and cfy, 

(c) For given values of a and i>, approximations are found 
for the optimum estimates of , cTy and r. 

Other sets of estimates considered are: 

, (1 ) Means a and b estimated independently from the r’s and 
the /s respectively, of the sample u) . 
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' (2) Maximum likelihood estimates of from cJ^y&nd cu^ 

and cXy from and u)y, each estimated independent- 
, ly of the other. The estimate of < 7 ^ is taken as the 
covariance from uj^y. The characteristic function of 
these estimates is found. 

(3) Estimates of < 7 ^ and <3^ taken as the square root of the 
weighted averages of the variances from uj^y and , 
and from Uj^y and u}y respectively, with the estimate of 
r taken as the ratio of the covariance of ui^ytQ the pro- 
duct of these estimates of the standard deviations. 

(4) Estimates of cr^ and the same as in (3), with r 
estimated entirely from u)^ 

The exact forms of the sampling distributions of the systems 
in (3) and (4) are found, as well as the asymptotic normal forms 
approached by these exact distributions as the size of the sample 
u) increases, subject to the condition that^»«( and are 
constant. The limiting value of the variance of the estimate of r 
in (4) was found to be less than that of r in (3). 

We have defined the amount of information available in a 
sample relative to any set of population parameters as the reci- 
procal of the determinant of the matrix of the limiting values, for 
large samples, of the variances and covariances of the maximum 
likelihood estimates of these parameters. It is shown that this 
determinant is smaller than that obtained from the asymptotic 
normal form approached by any other set of estimates of the 
same set of parameters. The amount of information relative to 
the parameters utilized by any other set of estimates is the reci- 
procal of the determinant of the matrix of the limiting values of 
the variances and covariances of this set of estimates. The meas- 
ure of the efficiency of any set of estimates is taken as the ratio 
of the amount of information yielded by this set to the amount 
yielded by the maximum likelihood estimates. The efficiency thus 
defined was found for each of the sets of estimates (1), (3) and 
(4). It was found that the set (4) is more efficient than set (3)i 



ON THE SAMPLING DISTRIBUTION OF THE 
MULTIPLE CORRELATION COEFFICIENT 

Rv S. S. WaKS* 

The problem of finding the distribution of the multiple cor- 
relation coefficient in samples from a normal population with a 
non-zero multiple correlation coefficient was solved in 1928 by 
Fisher’' by the application of geometrical methbds. In his 
derivation he used the facts that the population value p of the 
multiple correlation coefficient is invariant under linear trans- 
formations of the independent variates, and that the distribution 
of the multiple correlation coefficient is independent of all popu- 
lation parameters except p . 

In this paper it will be shown that the distribution of the 
multiple correlation coefficient can be derived directly from 
Wishart’s’ generalized product moment distribution without 
.making use of geometrical notions and the property bf the in-'' 
variance of p under linear transformations of the independent 
variates. Furthermore, it will not be necessary to show that the 
distribution will be independent of all population parameters 
except p 

The population value of the multiple correlation coefficient 
between a variate x, and a set of variates x, , x . it is 
the ordinary correlation coefficient between ar, and that linear 
function of the variates which will make this 

correlation a maximum. It can be expressed as ^ 

where A is the determinant of the correlations among all of the 

^National Research Fellow in Mathetnaties. 

R. A Fisher, The g;eneral saniplmg distribution of the multiple cqr- 
relation coelhcR’iil, Proceedings of the Royal Society of London, senes A. 
vol. J21 (1928), pp, 654-73, 

John VVisImrt, The generalized pioduct moment distribution' in samples 
trom a normal multivariate population, Biometrika, vol. 20A (1928) pp 
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variates \ A, is the determinant of correla- 
tions among the independent variates '^n i-'^'ioting the 

sample valneofjO*by/?^it is well known that /P=i-^ . where 

D andi^ are the determinants of sample correlations among the 
bets of variates and respectively, 

Let us suppose a sample of N items to be drawn at random 
from the normal t? -variate population whose distribution is 


( 1 ) 




where A-.. = t ■ A - 

V a^ojA 


P. 


t'j 


the determinant of correlations 


among the n variates, A^j is the cofactor of |0^yin A , <3; is the 
standard deviation of and A = j/l^y 


In the sample, let 


and 


"“/V ’ 




where is the value of for < -th individual of the 
sample, ^Visharf* has proved that the simultaneous distribution 
function of the set |» ( • • • n^is 
n(N~l) M 

(_i) ^ A ^ 


(2) f(aP 






Ya 


’) Wisliart, Inq. Cit, 
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where o;; is the determinant of the a's. 

I LJ I 

We shall define a moment-generating function 


(3) 




where the integration is to be taken over the field of all possible 
values of the a's and|< 2 ^^| is the cofactor oia,f in 

From this definition of is clear that 0(^,k) 


V 




IS the product moment L a (1-K J 


iJt will be shown that this 


expectation exists for /7=-A'which will yield the /r-//7 moment 
of (1-1?^) , from which the distribution of F can be found. 

To find observe that since' (2) is a probability 

function, its value over the field of all pos.sible values of the CL'S 
is unity, Hence, we must have 


(4) 



' N-n-Z 

a A da^O, 


where G - ' 


V 


^r(¥)r(¥h 


n/fi-D 




This relation 


holds for all positive values of N??! and for all values of Aij 
which will make the matrix || A/y | ^positive definite, 

If f(a) he integrated with respect to a,, , < 3,2 > ‘ ' °in > 
the resulting form will clearly be the distribution of. the set of 
a 's contained in I I and will be 
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r 5 ) 


ttl 

(1) b" 






where ^pq is the element in the p-th row and q- ih col- 
umn of the reciprocal form'* of the determinant which is the 
CO factor of the term in ^he first row and first column of the 
reciprocal form of j/d^y | . The value of Bp^ in terms of correla- 


tion coefficients and standard deviations is 


0) 


0) 


A " A,, , and A is the cofactor of 
more, Hence 


Ppq 


in A, 


where 


Further- 


d 






\a 


V 


da,d]a-a'] 


( 6 ) 


IM ±l -&L 


n-J ddi 


m. 


‘fX7\ 


cAp-d!i, 


where da/ « da^dai^ • ■ da./^ and d]^-a\ is the product 

of the differentials of all <3 's in joyc?^! 

Xow. it is clear that (6) is an identity for all values of 
and the population parameters cf and ••• rn] 

i^J) , for which both sides of (6) exist. Thus, we can per- 
form the following operations on (6) : 

(a) . Replace N by N*Zk . 

l N*Zk , , „ , 

(b) . Replace cr^. by a^i~W~* ( ‘■‘liZp - • nj 


*By the reciprocal form of a determinant jc^yl we mean the deter* 
ainant formed by replacing each ejenjent 9,, by the ratio where 


rainant formed by replacing each element c,. 
C, I is the cofactor of c, , and C‘\<^uV 


'•■1 
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(c). Replace by 

(cl). Multiply both sides of fhe identity by 

1 , -X" * 

(e). Multiply both sides by \apq\ . 

Accordingly, we find that the integral of the left side of (6) 
over all possible values of the a's is the definition of iPA,k), 
which must be equal to the integral of the right side over the 
field of all possible values of the a's in But the value of 

the integral of the right side cah be deduced at once from (4), 
Hertce, we finally obtain, 

-k N-1 ^ p.f£^ + k) 

( 7 ) ^ - r(k tM.) ' ’ 


where /\^is the determinant 4 with 4^^ replaced by 4,,-^ , 

and is the reciprocal of the cofactor of the element in the first 
row and first column of the reciprocal form of 4.,^ . 

That is. 




where is the cofactor of the element in the p-ih row and 

' q-ih column of , f p,q = Z,3,<- ■ n) The value of 

can be readily found by writing 

' I L 



where -4^^ except for l=j=1 and A^n . 

Increasing |4__^^^|to an n order determinam Dy inserting, 
as first row and first column, an additional row and column which 
will not change the value of the determinant, and multiplying it 
l^of-r^lwe find 

4 ,^^^ = 4 ^ 

t ’ / • ' 
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Therefore, 



Substituting this for 3^ in (7) and using the fact that 
we finally obtain 



Tims, it is evident that exists for sufficiently small 

values of Let us write ^ 

w .At-/ 


m.r) 




1 - 


(^-J\ 


and expand the second factor on the right into a Taylor series, 
Substituting in (9), we have the convergent series 


rf^) 


( 10 ) 


z 

A/-/ 


X L 
ioO 






For the coefficient of in the expansion of the right side of 
(10) in powers of .^.we find 


/NA., ) y V-Tt: 

I z / r(j^) 


( 11 ) 


i^O 





202 MULTIPLB CORRBUTION COBFPICIBNT 

which is the definition o( (1-^V ], We observe 

that (11) exists for all values of k and h for which 

.and Placing /?=-/: 

and pointing out that » we have as the k-th 

moment of 




(12) 


r {^) r (^) V ^^^0 




By using the relation 

. 1 

we can write (12) in the form 




e 


dd 


(13) 


rr-T^ri^) 


I [ 

I-O L 






'jrpjnnTrj 


dd. 


•The series in (13) is uniformly convergent in Q for OiOil 
and therefore, we can interchange the order of summation and 
integration and write 

(14) E [(1-F^) » ffl-e) ^ 4 
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where 


(15) 
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Thus, we have a distribution function of a variable & such 
that the moment of & is identical with the /r-M moment of 
for all positive values of ^ . It follows from Stekloff’s® 
theory of closure that jZ^/<^!imust be the only continuous solution 
of (14), where f is defined as (12). Therefore, 

the distribution of P is identical with that of 6 and can be 
written finally as 


,, r(V} 

(16) 


which is the distribution found by Fisher except that he uses the 
notation 77 ^ = 77 -/, the number of independent variates, and 
Nf the sample number. 


‘W. Stekloff : , Quelques applications nouvelles de la theorie de ffrme- 
ture au probleme de representation approchec des functjones et au probleme 
des moments, Memoire de I’Academie Imperial des Sciences de St. Peters- 
burg, vol. 32, no. 4, (1914). 

Z<J 



CURVE APPROXIMATION BY MEANS OF 
FUNCTIONS ANALOGOUS TO THE 
HERMITE POLYNOMIALS. 

By Herrick B. H. Greeveeaf 
I. Introduction 

I In an article by J. P. Gram entitled “Ueber die Entwickelung 
reeler Functionem in Reihen mittelst der Methode der kleinsten 
Quadrate”^ a unique procedure is set forth which leads to a very 
great simplification in the usual method of curve fitting by the 
method of least squares, That this method has not been given 
more consideration is probably due to lack of knowledge of its 
existence, rather than to lack of appreciation of its merit, Edward 
Condon, “ Raymond T, Birge and John D. Shea’ developed for- 
mulas by means of which curves can be fitted to certain types of 
data. Later, Harold T. tlavis and Voris V Latshaw’ developed 
specific formulas, with tables of coefficients, by means of which 
curves of the second to the seventh degree can be fitted to the 
, data with a minimum amount of computation. In a later paper, 
Professor Davis® has employed Gram’s method and in this way 
has developed a set of functions analagous to the Legendre poly- 
nbminals. 

The purposes of the present paper are : 

(1) To -develop formulas for fitting curves of the second 
to the sixth degree to given data by the method of least squares 
where the r?+l frequencies of the data have the terms of the 
. expansion of (j -t--) weighting factors. 

journal fur Mathematik, Vol. 94, 1894, pp. 41-73, especially pp. 42-46. 

*"The Rapid Fitting of a Certain Class of Empirical Formulae by the 
Method of l.east Squares.” Univ. of California Pub. in Math. Vol. 2/ 
No. 4, p(k 55-66, March 1927. 

‘‘ A Rapid Method for Calculating the Least Squares Solution of a 
Polynomial of any Degree.” University of California Publications in 
Mathematics. Vol. 2, No. 5, pp. 67-118, March 19^. 

^“Formulas for the Fitting of Polynomials to Data by the method of 
Least Squares." Annals of Mathematics, Second Series, Vol. 31, No. 1,^ 
Jam 1930, pp. 52-78. , . ' 



H. E. H. GREENLBAP 


20S 


(2) To develop by Gram’s method a set of functions 
analogous to the Hermite polynomials, by means of which curves 
of the second to the eighth degree can be fitted to data under the 
same conditions as in (1). 

(3) To study the properties of these functions, finding a 
generating function, a recurrence formula, a second order differ’ 
ence equation, and giving other methods for deriving them. 

(4) To apply the functions in finding a curve to fit given 

data 

(5) To furnish tables to facilitate rapid calculation of the 
coefficients of the required equation. 

II. Development By Ordinary Method of Least Squares 

Suppose we have given data in which the variates, x , are 
equally spaced, the observations being weighted with the binomial 
coefficients, having the origin at the mean. Thus, let the given 
data be 


X x.p ' x:./ Xf ■ ■ • • 
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data having an odd number of classmarks. If there should be 
:an eiven number^the data can be modified by leaving out one of 
f Ihe end variates, or by adding an item by extrapolation. It is 
also possible to transform these functions by moving the origin 
to the extreme left, thus having x vary from o to 77 . If this 
is done, the functions in part 3 of this paper will reduce to those 
generated by equation (18) in Gram’s article. 

It is required to find the coefficients in the equation 


( 1 ) 







such 

P 


that 


L C fy .y'/=E C \y -a - 




n^T) 


”]■ 


shall be a minimum. 

In this section, the ordinary method of least squares is used, 
leading to the n+l equations 


M, = 

( 2 ) 

" ^n,o^2 -^^n.Z % ^ 4 

where 

(3) N„=£ and 

X--P )^--P ^ 


(It is evident from the symmetry of the distribution that all 
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moments, with r odd will be identically zero.) 

These mJ equations can he solved more readily by dividing 
them into two sets, one containing the coeflficients of subscripts 

containing the subscripts o.^zr+1 ■ 

Thus we get 

: 

( 4 ) Mz 2^4 ^n. 4^ 


^4 " 0^4^ ‘^r7,2 ^6 ^ 4^0-^ 


and 


^ 3 ^4 ^ ^n, 5 ^6 * 


(4a) /Wj ^ ' •’ ' 

^5 ’ 1^6 ^n, 3^3'^ ^r>, 5 ^10 ' 

The computation of the moments, , may be accomplished 
in the following manner; 

Up)! 1 


(5) 


. {p+x)l(p-x)l 2^^ 


is the 


general term in the expansion of 

(Zp)i I ( P4x)!(p-x)! fP^PxX 

Cv ( p+X 4 -l)!(p-X-l)! (Zp)l P + X 4 


( 6 ) . ' . ('p444l) = ('p-4)Cj^ . 


Multiplying each side of (6) by (x-tl) and summing from 
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p \a x-=- p .we get 


E C .( p+Xll)(xil) ( p-x)(xi-l)^C 

. r\ X-r J. _ 


or 


-P 


-P 

~P ~ 'P 

^ k-Z 


P p P k+l ^ k P k 

C /.C pfki^l) C^-T,^x{xd)C^ 




] 




By virtue of (3), this Iiecomes 




-- p[r77^ ■ • ■ ] 

(7) ■ ■ ]' 


Combining the terms in 777 ^ . and recalling that all odd 
moments are identically zero, we reduce equation (7) to 


( 8 ) 

Zm 


k*i 




By means of this recurrence relationship, the moments are 
found to be 
m=l 


n,,B 


P 

T 
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% 16 ^4 4 ^ Z 

^ 2^5 p p 

%" iz e a 4 z 

lomp<‘ >^ Si97Sp'» , „> 

-« 64 -}Z 16 ^ 

WZJp^ ^\ p 
4 ' Z 

iysi3Jp^^^ $4y94yp^'^^ 94 ^ 949 p 

4^,1 — 7T7j — r3 ^ *7207yp 

128 64 16 

194199 4094 P 
8 4 2 


where p7^1^ p{p-l)( p-Z) • • • •(p~p4l). 
When expanded, the values are 

777^. L 


<4 4 

pa9p^^l9p4 4) 

Q 

p(10}p^-ZlOp^^l47p -y4) 

rr,^ _ 

Pf949p'*- 3190 p 4099 'p^~ Z370p *496) 

32 

^ pa0399p^- ^1973 p^* 107419 p^- 111709 p* 

— 

36998 p- 11096 ) 

' ^ 64 
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m PfWD^p ^■~94m5p 'p zdjye^^p-^- 4 J 79 W 

"V4 = . -J^ — 

4114110 p 1911000 p 

^ /za 


Knowing the moments, it is now possible, by explicit cal- 
culation, to find the coefficients, a , in (1) for special cases. 

//; *■ 

Case L Linear , 


(4) and (4a) both reduce to single equations 





(9) (a) 

M 

° 

(b) 





where A 

Case II. 

Quadratic. 


y = (a 

Z,o ^Z, 1 ^ 


* This leads to 

the two sets 

of equations, 


(a) 

and 

(b) ^ . 

(Notice that one set of equations in e?ich case is identical 
wifh a set in the preceding case. Therefore, only one of the two 
sets needs to be solved.) 



(10) 


fr. B. H. GRBBNLBAP 
(b) is identical with (b) in (9), 

j - a, A , M. . 

z,d 1,1 11 

Solving (a), we have 
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^2,0 * 

^2,2 ^2 > 




~ pfZp-1) 


Case III. Cubic. 

The resulting equations arc 


(a) 


>^2^3,2 > 


(b) 




/^ 3 = 
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cij^i ^ A^M, * B^M^, 
^j,3 = ^ ^3 ^3 > 


. , Z(l5p^^l^p + A) 

where ^ 


4- 


-40P-1) 


^ <3 

C, = — T- > 

^ <3 

ds ^3p(p-J){Zp-l). 

Case IV. Quartk. 

The coefficients with the second subscript odd, are identical 
with those in case III. The.,other coefficients are found from the 
equations 

, ^0^ ’^0^4,0+ ^z<=^4,z + ^4 ^^4, 4 > 

^z°-4,0*'^4^4,z ->■ m6‘^4^4 . 

^4 ^4,0 3- rr?^ ^ . 


• ^ 4.1 ’ ^ 3.1 


(X ^ O 
4,3 °i ,3 . 


and 
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ll2) 


^ 4 ,Z ~ ^ 4 ^ 0 '*' ^4 ^ ^ 4^41 

^■•4. “ ^4 ^ ^4 ^2 ■*■ ^4 ^4 ) 


where 


4 « ll^P^-25p^6) , 
Zd'4 
-iO(p-l ) 


4 * - 
c - ~ 

^4~ d- 


< 


d'^.(Zp^I)(Zp-V. 


n» ^(24p^~39p4l7) 

1 : 

^ 8 ( 3 p-Z) 

V " d' 




8 


d.^’ipipdKZp-Wp-t 


Case V. Quiiitic. 

As jn the previous case, we have at once 


f,o 40 ' 

'^s.z' ^41' 

) 

V“ V' . 
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and 


a_jr j - mga^^^, 


M^~ *■ ’^/o < 

from which 


( 13 ) B^M, 4l)^ /% 

where 

. (f 25 p^~ ZlOOp^^ZQJ^p^- 1480 p 4276 ) 

■ ds 

„ -(ZOXZlp^-d^p^ 4 76p-IZ) . 

- — : — ^ ^ ' * 

- za^p^-z^p/t ) 
ds 

_ AOOZp’^-Zlp+ld) 

, 

^ ds- 

^ ■ ^80(p-I) . 

Rjf* T- ^ - ■’ » 

^ df = /^pfp- l)(p '2 Xzp- l)(2p - 3). 

Case VI. Sextic. 


As before, we have 


‘^a,j “ 

* 6,7 "^^ 3 ' 
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From the equations 



^0%0-^^Z^6,Z 


n- 



/%= 

'■ ^A%,C 






we obtain 



^^6%' 

(14) ' 





where 

A UOp^A I47P-J0) 

Zdl, 

o 7(I^P^-4 &p^Z 3) 

^0 

^ 14 (J P-1) 

'-M rr — ' 



4 ,'. i(zp-mp-i)(zp-i). 

P iszjOpK 119^ Zl910p - ^zib) 

<^6 

fr. Z0(nip\6dlp^*d4bp. 304 ), ‘ 
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l\h 


* 

IA~ 147p + U0 )^ 

^ r/ 

/ 

T Q0(^P-4) 


d^ = 4Jp{p-l)(p~Z)(Zp-l)(Zp-^XZp-^). 


Tallies for all, the coefficients. A,, A^, J(j , 

to ten significant figures for p from 1 to 20 will be found at 
the end of this pa))er 

Special attention is directed to the last coefficient in each case, 
^ , as reference will be made to it later, It is desirable to 

be able to compute this coefficient without having to solve a set 
of equations. 

Let represent the determinant 


'1 


(I5a) 








'^6 ' 

■ • z 


^6 

^<9 • ' 

• ^n+A 





J 
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for n an even integar, and 



% ■ ■ • • 



^<9 • • 

• • ^ 77^7 


^to ' ' \ 

•^ 77/7 

^•n+l 

^ 77.7 ^ 77/7 ' 

Z ?7 


for n odd. I'y definition 

Let 'P(r7 denote the 'iame determinants, the last column 

Ix'ing replaced by A^, or by 

Ml, Mj, Mg ' • according to whether r? is even or odd. 
t^imilarly, 'P(-n represent the oiiginiil determinants, the 

last cnlutnn being replace<l by • a:.' ^ or by 

X, (or )i even or odd respectively, 

It is clear from the normal equations in case r , that 

777^ 777^ Mg 

^2 


( 10 ) 


^r^r*Z • 




7772 • •• 

■ • 777/. 

777^ / ■ • ■ . 

• ■ 'Vt-tZ 

^r+Z ■ 

■^zr 


for T? even. 



218 


CURVE APPROXIMATION 



\^0 

^2 

^0 




M, 

Thus, for r*-4 




we have, a = - 

^4. 

^6 


IV w V 







^6 


% 

^6 
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6^4/ Z! (2p) 

” ‘^zTt^^^Tz) 


2 n 

Similarly, since ^ 




JUiUpf^^Up). 


2it(3-n) 

It is clear that for n even, or odd, 


(19) P, 




m) fn-Z) ln-4) 

n/(n-2)!(r>-4)l • M (Zp) /V ■ ■ ■ • ^ 

^ZCrt* n-Z + ) 


eacli scries ending with n-ns O or 1, for n even, or odd. , 
With the recurrence formula (17) or >the general formula 
(19) it is a simple matter to evaluate P (n) a list of which 
follows : 


^ _ pC2p~l) 

nzj '4~' 

7^ 53 ■■ , I I - ■ ■! \ 

YJ[) 16 

:^z ' 
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4? P ^fp-lffp-Z)(Zp-lf(Zp-J) 

Z^6 


( 20 ) 


^ D5p^tp^O^(p-Z)(2p-lf(Zp-3fUp-5) 

Pm — ' 

I6J84 

42. '^(p^l)^(p-2f(p-})Up-l) ^ 




^2,765 

f2p^?)^(2p-7f(2p-7) 

22;rbd 


' in. Development i’y Gram’s Method. 

Let the tariates. . the observations, . and the weights, 
, be given as before. VVe assume that there exists a set of 
functions 

of degrees 0, I, 2 p , respectively, each of the form 


( 21 ) b^-p b^x + bj^x^ . • ■ ■ 


and such that 





for , m7 n 
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the value of 5^ to be determiiie4 later. 

We wish to approximate by means of these functions a 
function of , 


123) y =y'<,- 





which will he fhe equation of a curve fitting the given data, 
Multiplying each side of (23) by (x) and summing 
fuiin x=~p to 4 -p , — applying (22),— we have 

( 24) Z : 

P - P 

Substitute m the left member of (24) the value of 
in (21) and we have i ^ 

P p P P 

l;>Jl b^L + bX 

•p ‘p < -p ~p 

which, by (3), becomes 


(25) 




b^Mg^biM^ ~hb^M^+ ■ ' • • 




This value of is identical with a found by the first 
method as may be shown in the 'following manner : 

'Let 

(26) , 

which is to be minimized .where • " 
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Taking the partial derivative of J with respect to , we have 


^ - -dcJ/Ay^- a, t U) . 

which reduces, by (22), to 




-0 


(27) 





or 



as in (24) 


We are, therefore, able to write the values of ~ in ('25) 

by comparing the coefficients of the moments, in (25) with 
those ina^ ^ in equations (9) to (14), or in (16). 

Thus, for r= 4, we have 




^ . t>,M, b,M, 

a= — — rr— -r — ■+ V ■+ — tj — / 




( 28 ) --.f 

4 " '' 


^ 4:4.- 


( 29 ) 
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Now the value of C w (z) 


p Zl-^r' 


can be 


found by comparing the coefficients of the expansions 

given by the two different methods. In case IV, we have 


■ ■ ■ -f- 


of lower degree \nz)C^ , 

so the desired coefficient is 
By (23), we have 

y^a^<t>^(z)C^+aj(p^(x)C^+ ■ • 


'4 " 4 


5 ^ 4 '■'4 4 


But by (28) ^ and the desired coefficient is 


^4^4 Z 


Equating these coefficients, we have 


(30) 

Therefore, 5^ is equal to the reciprocal of the coefficient 
of MAn - • But examination of (16) shows that this 
coefficient is 



224 


CURVn APPROXIMATION 


nn ' 5 = • 

The coefficient of x.^ \x\. (f>^( x) can now I)e found by 

'^(r) ^ 


multiplying the coefficient of in a 


r,r p 


(r^z) 


Indeed, it is possil^le to express (x) as the quotient of two 
determinants The coefficient of ;k;Mn will be identical 

with that of Mi in pjy We may, therefore,' write 


C 421 I ^ P fr,x y . 

^Z) 

Proi ceding in this manner, we obtain 


(xh x^- Op-Z)x^+ — - jf’ -- ' ! 

( 33 ) , 
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I4(p-z)x^ 

lOjlp^- :>Z5 74ZP-Z64 . 


The coefficient, ai , of /'xj is givei^ below. In addition 
to the values obtained from (9) to (14), and have been 

added. The values of j are given in 

the tables at the end of this paper. 

a, ■ , 










( 
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where 


„ d(W^p^-4Z0p^^441p~90) 

dj 

^ tb( 105 p^-Ji 5 p+ 

G u — ’ 


4 


224 ('3p- 4) 


and 


^ 7 " dy ’ 

4 = ^l^p{p-l)(p-2)(p^V(Zp4)(Zp-'^)iZp-)), 

z 






3 (Ip-l)iz p-V(2p-5)(Zp- 7} 

. - >6(10^ p^- fZ^p^-t- 74Zp-Z64 )_ 


“a 

IIZ(l5p^-4yp-^44) , 
^8 

~44d(p-Z) 

r^'t 


E --H 

ds ' 

dg = Jl^p(pd)(p-Z)(p-^)(Zp-l)(Zp-})(2p,-5)(2p2l}, 
It was shown above how the coefficients of x-^ in ^^(x) 

could be found from those of in . It is evident from 
that, that with (x) known, the value of can be immediately 
determined by changing to Mi and multiplying the result by 
f^r “ ^ shown later that these 's can 

be determined independent of the dkerminants previously used, 
and more easily, a^and <2^ were determined from <^7 (x) and, 
<t>Q (x } » respectively, and then checked by means of (16 ). 
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IV. Properties of the ( j ) Function, 

The similarity between the ^ functions just derived and the 
Hermite polynomials, of which these may be said to be the analog, 
is evident, and leads one Ui expect that there must lie a generating 
function analogous by means of which all of the 05 

can be found. Also, one would naturally expect to find a recur- 
rence formula and a second order difference equation analogous 
to the relationships existing between the Hermite polynonlials. 
This proves to be true. We have, in fact, 

(35) (~ C^(p+x.) where 

(p-l-X-)(p-hX-l)( p-i-X-Z)- ■ • • ( p^X~ 7?^ I), 

Expand the right member of (35) and divide both sides by 
i and we obtain 

r)(p-x:/’^'^^( P*x.) 

(36) 

A study of the expression in the bracket brings out the fol- 
lowing facts: 

(3?) The coefficient of is ('-Z)’^ , for it is seen to be 
equal to ' 

(■» -af-H) I'vryY/W' ■ • ■ ny-i) " 

V is even, only even powers of x appear in the 
expanded form of the bracket; for n odd, only odd powers of 
% will occur. 
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Consider the coefficients of etc. With 

ri even, the terms of odd degree in in { will be nega- 
tive, and cancel the corresponding terms in the last parenthesis, 

■ If n is odd, the factor (-/)^ causes the corre- 
sponding terms of even degree in the same two expansions to have ‘ 
opposite signs, and, therefore, to vanish. Similar reasoning holds 
for every pair of products in the bracket which have the equal 
coefficients, If is odd, this will include every 

term. If r? is even, the middle term will be 

^p-xj ^ ipi-x) ^ =(p’x)(p-t'x)(p-x-l)(p+x-lXp-x-2Xp+x-2j.,^ 

the odd powers ofp having zero coefficients. 

Therefore, either all terms are of even, or all are of odd 
degree. 

It is necessary first to prove that the relations (22) hold; 
that is, 


(a) for rr?/n. 





To prove the first relationship, we may proceed as follows ; 
Let 

P ' 



where We may assume without loss of generality that n> rn\ 
for if it is not, ^^^r^k^and can be interchanged. Using 
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the fovinula for finite integration, 


we Itave 


A. 


Itetween the limits -p and p*l 


Now (*»y 35) 


'(-i) ^ 


■(- 0 \ 


^r?-3) jy) . ,0^^^ 




■(-{)% ^ ^ ^ 

nV' (^p)L — .(p^.x)\f{<p^x>.(p^*^■^^^ 
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For the lower limit, -p , the factor pi-x =>0 .and for 




1 





P-t-i - 
-p 


Similarly, each of the succeeding terms up to and including 

the term ^ ^ becomes 

identically zero because of the factor or for the lower 
and upper limits, respectively. 

Since is of the w-Ji'/ir degree in X , fV/'and 

all higher differences are identically zero. 


( 41 ) 


for n. 


If 777-/7 , the first 777 terms vanish as in the preceding 
case. The (m-hl)th term is the only term left in the series. 

( 42 ) 


, . sTn . -m-J / 

^(-1) m! A 


But 






.(m)\ 


// )^-J dp)! 

' Z ^P ( piX’Hrn)l(p-ii'm)l 
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( 43 ) 


-f 

3 Zp-* m 


(p-fTi)/ (p~x~m) I 


It is now necessary to find a function, such that 


A u 


^ (P+x- )((p-x,-Tm)l 

Since A = ( E - 1) , we may write 

All ^ (E-1 )•* u = — > 

^ X (pty.)l(p~)i-m)! 


~1 


1 


I-E" (p-Lx)!(p-xf-m)! 




(p-LX)!(p-X- TT))! 


( 44 ) . . A 777 )! 


P*1 

-p 


P-lI 

-p 


1 _ i 1 

(p+x-)!(p^»-m)l (p-t-a-f-l)! ( p-x-m-1)! (pLx+2)l(p-x- m-2) 

between the limits -p and p-hl . Substitution of the 
upper limit, pi^I , makes every term zero, because (p-x m~k)!-oo. 


For K’l-p , the right member becomes 

V. i-. 

OiUp-m)! l!(2p-m-l)! 2l(^p-m-Z)! 

^ (Zp-m~l)f lZp-m-(2p-rn-l)),! ^ (^p-m)!Ol ' 

* 

all succeeding terms being zero because of the second factor. 

-i X 

' (p^x)! (p-x-mj! 
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« 

1 

/ 


(Zp- m)! 




■Zp-rr>y^. 




Up-m 



or 


(4S) 


^ Zp-m 

(zp- m)l 


Returning to (43) and then to (42), we have 



' (zp-m)l 


3 


m.HZp) 


by (22), 




^irp) 

^(rr)~2) 


on /• ’’rm.z, ■ See (17)] 

Therefore, £i^ Cxi satisfies both 

t 

condition^ of , (22). 

Recurrence Formula. 

- It is necessary to. note that 
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^ 2^p (p+;c)!(p-x)i 




p-*J^-l . 


(p+x-ml) 


('/3-fx) (-Zx-t^rt-J), 


* Cp-^x) n('-Z)A (p-i-x^I ) '"'V 

1 X^'^= (1 *a} C^( p-f-x) 

.'.A C^(p^■x) i f-Zx-f-p}A”C^fp/x/^^ 

-Z nA\{p-^x/”-Z nA'^'b^^pfx/’^'^ 

= - -2nAXrp^-)^^ 

^"2x4 ^CJp-f-z/^-pA t'p^x^/^ '\-2x+ r>- 1) 

- nA"''cjp-fx)^^*^(Zp^Zx-Zn-fZ) 

Now A ^C^(p-fx) (~z) (x)^ 
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(49) 4<p nf^ptl- . fx) = O. 


/ 7 -/ 


Difference Equation. 

To simplify the reductions later in this development, it is 
desirable to have the following identities: — 


(a) ACy-C^ 


(b) 


r— . -^1 




-ZX.-1 


p+x-^- / 
4x.^+3x+2-Zp 


X- X {p-f-x-t-2)(p^- x-i-l) 


(50) 


p-x 


“ ^X p + x-f-i ' 


, . ^ -Zx-i - p-x -Zx~3 

(clj ^x p^X4r p4X4z‘ 

(el /• = r rp-x)(p-x-l) 

^ ^p^:>:^2)(p4X4j) 

Let Cy^( p4x/"\ 

A^u^ = A ( p 4 x) (-Z)^Cy^ 4n 


Au.y=C^(p+x) ^(-Zx+n-l), (by 48) 

and, multiplying both sides by (p^x-n-t-l) , we get 

(.p+x-nxL) Auy. = (-Zx4n-l) u^. 

Difference this equation n+l times, having 

(p-t-x- rj'+ i)A Uy + fn^DA 


( 51 ) 


n*/ 


= (-Zx4 n-iM Uji,-Z (n-f-DA ’^u 


X4 / 
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or. since (UA)-^ , Ave have 

( 52 ) r 7 ^J 

~(-Z)i>4-v-l)A u^~Z(n+l)A U^-Z(ri4-I)A^U^ 
Combining like differences, we get 

T7'^2 

(p4-3i-i-Z)A u 4' (2xi+2t7-f-4)A u -)-z{n+})A u ^O, 
«*> 26 » 




+ 2 fr 7 't-J)(-Z) C^^^U)~ 0 . 


This may be simplified by making the following reductio is ; 

(p^Z4Z)(p4X4.1) ” ^ (p*X4Z)tp^Z4j) 

fp4X4-Z}( p-hx 4-j) 
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Substituting these values m (53)^ noticing that Cy. is a 
common factor, and that j can be made a factor by 

multiplying the last term by > we have 

fS4) ^ 

(p ■■x)(p-x-l)A 4„(x) 

■i^x^*dx*2-Zp*f^2x-i-2n^)(-Zx-l)-t-Z{rn-I)(p+x+J^ O, 

The coefficient of reduces to Zy)(p-xX and that of 

64>„fx), iQ (p-x)(2„-Z-Zx) . Dividing by 

we obtain the desired second order difference equation 

* ^ (p-x-l)A Cx)i-Z{ n-l~x)A O. 

Other Methods of Deriving These Functions, — 
3rd. Method. 

Equations (2) were divided into two groups, (4) and (4a), 
from which these functions were developed. It would be expected 
that the functions could be derived from (2). This is easily seen 
to be true. 

Let be the determinant formed by the coefficients of 
cj„ ,. in (2), 



% 

0 


0 

^4 • ' • • 

(56), Let 

\0 

'^z 

0 

% 

0 .... 



0 . 

* • 

0 

m(, .... 

• » • • • 


• - 

* 


• 

■ ' 

Zn 


I 
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for ri'>0 In the special case , we will define 


LetG 




0 0 


Q O 


rrig^ 0 0 


Then (x) is easily seen to be 


(57) 




I 


X. 


Explicitly, we will have 


' ^(o> 


1 

O X 






m 0 i 
a 

\0 X 

O x^ 


^ O 
\o m. 


*X 


2’^f P 

* - ■' ■ = * - » I 
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'iDq 0 i 

0 0 X 

0 yr)^ x^ 
0 0 x^ 




0 

0 0 
0 m^ 






m. 


X, 


and so o'!!. 

The coefficients of the ^ 's can be found from the formula 



where Q(i is , the last column being replaced with , 
, Thus, we have 
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rr?, 


Mo, 


% ^01 


0 M, 


L ^0^1 -i. M, ,x A u 


THo 0 \ Wg IT}^ 

^ % 


Trig 0 Mg 


0 w- M, 


fn^ 0 M: 


% ^ 


0 O 


Mz ^ ^4. 








M, 


’Zp^r'o p(2pA) 
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0 777 ^ ^ 

O 777, /% 

0 O 

1777, O 0\ 


777 ^ (m^m^- m^h n7 ^( - m^) 


O 0 777 J 


m^ O yn^ 0 
0 m 0 >n 

■4- b 


.e^>M,4^, 

j-^p^(^p-l)(p~l) 




This method becomes more difficult than the first liecause of 
the higher order determinants, and is, therefore, of less value in 
deriving the functions. 

A fourth method of developing these polynomials is to build 
up a set of orthogonal functions in the following manner 


‘This method is given in a thesis by Harry R. Mathias, entitled "Prop- 
erties of Orthogonal and Biorthogonal Functions from the Standwint of 
Integral ^nations, " written at Indiana University August, 1925. He cites 
as his refei;ence E,' Goursat — "Recherches sur les equations integrals 
lineaires, Ann. de la Fac. De Toulouse, t. 10, 2nd series, 1908, pp, 5-98, 
especially page 66. 
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Assume a set of functions 

It is required to find a set of functions, , such that 
P 

Z C 0 fx)(^^(x)^0, -for m-^rt, 

“P 

Let (^^(^)=^fa(x)'‘ J. 

We may then form the equations 

^ *0 , obtaining 



159 ) ^ 


p 

-P 

the solutions of which are 

a _ p(3p~l) 

a^’O, < 3 ^- ^ 4 * — — * 
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Let = Z’-. <pj^()c), 

d^(x) = x^- §■ , 


... ^ /:> (3/0-1)^ 
XT- ~ 


Form a set of equations, similar to (59), using the 0's 

p 

Z CJx^- §-b,x:>)x^O, 

-p ^ ' 


P 

ZCJx^~ b,x)p^O^ 

-p X ^ > 


~b^x)x^O. 

From these equations, we get 

■ 4-a bj=0. 


Let 

Similarly, the equations 

t 

z cjx I - c, (x^- ^ ; j (x^- f)=o, 
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lead to the values 


C^-Op-Z). 


Cl is easily seen to equal 0 , as the term independent of 
Cl is L (C^ multiplied by an odd function in x . Expanding 
the second equation, we get 


LCJx^-p 

-p z . 






p 1 



L. 

■p 

4‘ Ji 





^d'1^4 


and substituting the values of the moments, we reduce C2, 
to the value above, = (ip-Z) . 

Eet 

-Pp-z)U’-- ft 


This method may be continued to obtain as many of the ^ ‘s 
as desired. It is to be noted that X CxJ is (}) (x) . However, 
the proof of this would necessitate adding another function, 
fyCx) <‘X^ , at the beginning, and carrying out the process with 
another equation in each set. 
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Method 5. 

Grain's equation, number 18, is 

This gives ^‘'s for the origin at the left end of the distribu- 
tion, the variates running from d? to r? . Let this equation be 
transformed as, follows: 

' Let m= n, 

%-s.ic^4-p , the primes being dropped, 

ri=2p 

t 

If the values of r?BO, 1,2^ be substituted, 

we will have = (x) required func- 

tions are found again. 

« ^ 

VI. Example. 

As an example to illustrate the use of this development, I 
have chosen one used by Karl Pearson,- the data of which he 
attributes to 'f. N. Thiele,® and are the frequencies in a game of 
“patience.’’ On page 295, Vol. I, Pearson states, “Now either of 
the curves in Illustration I and II is a good example of the im- 
possibility of using the method of least squares for systematic 
curve fitting.” (The set of data below is his illustration 11.) 

In volume 2, using the Method of Moments, he fits curves of 

'Gram uses the notation 77 instead < of. and ior(^). 

*On the Systematic Fitting of Curves to Observations and measure- 
- ments. Biometrika, vol. 1, (1902), pp. 26S-303; vol. (1903) pp/1-27, 
(1889) Almindelig lagttagelseslaere, Copenhagen,. 
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tlie second to the .sixth des^rce to this data. Noting that his state- 
ineiu, page 18, "taking the sixth paiahola a.s the*best fit" is correct, 
it U found that the .siiin of the sciiiarcs of the errors is more than 
1400. ] li.s results for the skew frequency curve gives as the sum 
of the ,'ajuau's of the errors, 782, which indicates the second curve 
to lie a more acciiiate fit. 

Ja'I the given data be 


Value of Chainctei 

3 6 7 8 9 

10 11 12 13 14 IS 16 17 18 19 

Fi eniK'UfS 

0 0 .1 7 .45 IQl RQ 94 70 46 30 15 4 S 1 

Cla--'. Mark's (x) 

—7—6 — "5'-"4~""*3 ' 

-2-1 0 1 2 .3 4 5 6 7 


Tlie mean of thi.s distribution being 11.86, the origin has 
lieen chosen as at 12 ; p, therefore, is 7. 

Suppose we wish to find the curve 


'I'he values of the coefficients, ai , are 
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P 


M. ^Ly 

!■ -p 


, are computed and found to be 


M =^00, M - 146b, 

o ' j ' 

M.^-70, M -Zb, bb4, 

/ ^ 

M^-ZOdQ, 64,0/0, 


M^-bOZJba. 


From Table I, for p=7 , we find 

^ 0.28571 '42857. ^ .(2)50260 S5026. 

5^ = -0,15384 61538, - -.(2)10656 01066, 

= 0.043956 04396, = (4)35520 03352, 

. * -0.048840 04884, " -.(2)10360 01036, 

* (2)48840 04884, = .f3)997l9 21083, 

* 

- .(1)13986 01399, -.= -.(3)11182 23340. 

' =-.(2)84360 08436, = .(5)26311 13742. 

- .(3)44400 04440, 

( 

Substituting in the above equations, and 'puttini? p«7 in 

we have 
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<7> 

= i, 



= 


4/>c) 


• 4 ^ 

^■7, 


= 

JOx, 


= 




I4J. 9x, 


= Ai-*- 

42. 7 ^79x^-993. 77, 


le corresponding values, and collecting like 
have 

^6 - 69.80444 

a J9964Ai^^Q/dOP6x.^) (T^ . 

attention here to the fact, that if it is desired 
fi of the second, third, fourth, or fifth degree, 
be used, without change, using only as many 
egree required. Thus for the fourth degr^ 

2, rx) -f- a ^ • 

787 ^Ox - 8. Z742ix^^ 10. ^7877 x ’ 
+J.ZJ744x'*)C^.l 
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Sul )stitiitin,ij value's of fioin -/ to ^/in the above eiiuatiun, 
we obtain the folhming results: 


X 

y' 

(From Tabit 11) 

II 

yin 

ik'.in'.f 

ink'i!cr 

Observed 

Mnur 

-7 

,1,38,1,714 

,(4)610.1,1 1.16 • 

20 

0 

— 

ml 

-6 

258,77,1 

.(.1)85449 219 


(1 


H 

-5 

- 20,817 

, (2). 1,1, 141 992 

BH 

0 



-4 

42I,1'/J 

.(1)22216 797 

■ii 

9 


j 


71.1,4,14 

191 

4.1,19 

44 

,15 

! 9 

-2 


0 12219 2,18 


8(. 

101 

-1,1 

-1 

5,19.8,15 

0,18.128 857 



89 

1(1 

0 

415.2,14 




94 

"" 7 

1 





70 

,1 

2 

426.868 

0,12219 2.18 

.1216 

H9 

46 

6 

3 

417.224 

(1)61096 191 

25.49 

«26 

.10 

- 4 

4 


.(1)22216 797 

11.28 

11 

15 

-4 

S 

14,11.276 

,(2)5.1541 992 

7,9.1 

8 

4 

4 

6 

5016:640 

,(3)85449 219 

4.28 

4 

5 

- 1 

7 

14822.246 

.(4)610,15 156 

.90 

1 

1 

0 


The sum of the squares of the errors for this curve is 562, 
as compared with more than 1400 in Pearson’s first method, aiifl 
782 for his skew fre(iuency curve. 

The fourth degree curve, found by this method, gives 1170 
for the sum of the squares of the errors. 

’•‘Taken as 26 so that Ly = Ly^~?00, 
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TABLE I 


(Number in parenthesis indicates the luimbcr of ciphers between the 
decimal point and the first significant figure.) 


p 





P 

1 


2,000(K) IKXlOO 



1 

2 


1.66666 66667 

- 0.66666 66667 


2 

i 

0.66666 66667 

1.60000 00000 


026666 66667 

3 

4 


1.57142 85714 

- 0.28.571 42857 


4 

5 


1 55555 55556 



5 

d 

0.33333 33333 

1.54545 45455 



6 

7 

0.28571 42857 

1,5.3846 15.385 

- 0.15384 61538 


7 

8 

0 25000 OOlXlO 

1 53333 3.3333 

- ().1,3333 3,3333 

,(1)3.3333 3333.3 

8 

0 

022222 22222 

1.52941 17647 

- 01)764 70.588 


9 

10 

0.20000 OOO(K) 

1.526.31 578)5 



10 

11 

0,18181 81818 

1,52.380 952,30 

-(1)952.38 09524 


11 

12 

0.16666 66667 

1.52173 91.304 

-7(1)86956 52174 


12 

13 

0.15384 61538 


f -nirnm 


13 

14 

0,14285 71429 


n if liIlniriT 


14 

15 

0,13333 33333 




IS 

16 





16 

17 

0,11764 70588 

1.51515 15152 



17 

18 

O.lllll mil 

1.51428 57143 

1(1)57142 85714 


18 

10 

0,10526 31579 

1.51351 351.35 

1(1)54054 05405 


19 

20 

0 10000 00000 

1.51282 05128 

t(1)51282 05128 


20 


P 

V .. 



P 

2 

3.77777 77778 

- 1 mil 11111 

0.44444 44444 

2 

3 

2,08888 88889 

~ 0.35555 55556 

.(1)88888 88889 

3 

4 

1,46031 74602 

- 0.17460 31746 

.(1)31746 03175 

4 

5 

1.12592 59259 

- 0.10370 37037 

.(1)14814 81481 

5 

6' 

0.91717 17172 

-,(1)63686 86)869 

(2)80808 08081 

6 

7 

0,77411 47741 

-.(1)48840 04884 

.(2)48840 04884 

7 

8 

0.66984 12698 

-.(1)36507 93651 

.(2)31746 03175 

8 

9 

0,59041 39434 

-,(1)28322 44009 

.(2)21786 49238 

9' 

10 

0.52787 52437 

-,(1)22612 08577 

,(2)15594 54190 

10 

M 

0.47734 48773 

-,(1)18470 41847 

.j(2) 11544 01154 

11 

12 

0.43566 09574 

-.(1)15371 10232 

.(3)87834 87044 

12 

13 

0.40068 37607 

-.(1)12991 45299 

. .(3)68376 06838 

13 

14 

0 37091 30375 

-.(1)11124 67779 

.(3)54266 72093 

14 

IS 

0.34526 54625 

-.(2)96332 78599 

.(3)43787 62999 

15 

16 

0 32293 90681 

- (2)84229 39068 

,(3)35842 29391 


17 

0.30332 73916 

-.(2)74272 13310 

.(3)29708 85324 

17 

18 

0 28596 32742 

-.(2)65981 94834 

(3)24898 84843 

18 

19 

0 27048 10073 

-.(2)59006 37480 

(3)21073 70528 

19 

20 

0 2,5659 01934 

- (2)53081 42149 

(3)17993 70220 














2S0 


CURVE APPROXIMATION 


p 


4 



2 

2.66666 66667 

-3.33333 33333 

0 66666 66667 

2 

3 

2.20000 00000 

- 1.33333 33333 

0.13333 33333 

3 

4 

2.08571 42857 

- 0,85714 28571 

.(nS7142 85714 

4 

s 

2.03174 60317 

- 0 63492 63492 

,{1)31746 03175 

5 

6 

2.00000 OOOOO 

- 0,50505 05051 

.(1)20202 02020 

6 

7 

1.97902 09790 

- 0,41958 04196 

.(1)13986 01399 

7 

8 

1.96410 25641 

- 0.35897 43590 

.(1)10256 41025 

8 

9 

1,95294 1176S 

- 0,31372 54902 

.(2)78431 37255 

9 

10 

1.94427 24458 

- 0.27863 77709 

.(2)61919 50464 

10 

11 

1.93734 33584 

- 0.25062 65664 

,( 2)50125 31328 

11 

12 

1.93167 70186 

- 0.22774 32712 

,(2)41407 86749 

12 

13 

1.92695 65217 

- 0,20869 56522 

.(2)34782 60870 

13 

14 

1.92296 29630 

- 0.19259 2S926 

.{2)29629 62963 

14 

IS 

1.91954 02299 

- 0.17879 94891 

.(2)25542 78416 

IS 

16 

1.91657 39711 

- 0.16685 20578 

.(2)22246 94105 

16 

17 

1.91397 84946 

- 0.15640 27370 

.(2)19550 34213 

17 

18 

1.91168 83117 

- 0.14718 61472 

.(2)17316 01732 

18 

19 

1.90965 25096 

- 0.13899 61390 

,(2)15444 01544 

19 

20 

1.90783 09078 

- 0.13167 01317 

.(2)13860 01386 

20 


P 




P 

2 

7.77777 77778 

- 1.77777 77778 

0.44444 44444 

HQ 

1 

1.71851 8518S 

- 0.20740 74074 

.(1)29629 62963 


4 

0.77777 77778 

- .(1)63492 06349 

,(2)63492 06349 


5 

0.44656 08466 

- .(1)27513 22751 

.(2)21164 02116 


6 

0.29046 01571 

-.(1)14365 88103 

.(3)89786 75645 


1 

0.20424 02042 

- .(2)84360 08436 

.(3)44400 04440 


8 

0.15152 62515 

-.(2)53724 05372 

.(3)24420 02442 


9 

0.11692 06424 

- .(2)36310 82062 

,(3)14524 32824 


m 

.(1)92970 98957 

- .(2)25685 12785 

,(4)91732 59947 


11 

.(1)75704 41255 

- .(2)18834 96620 

.(4)60757 95549 

11 

12 

.(1)62843 75850 

- .(2)14220 88379 

.(4)41826 12878 

12 

13 

.(1)53006 31735 

- .(2)10999 62840 

.(4)29728 72538 

13 

14 

.(1)45312 71198 

- .(3)86826 75349 

.(4)21706 68837 

14 

IS 

.(1)39181 82002 

-.(3)69735 85518 

.(4)16217 64074 

15 

16, 

.(1)34217 03127 

-.(3)56853 29378 

.(4)12359 41169 

16 

17 

.(1)30140 11078 

- .(3)46959 15512 

.(5)95835 01045 

17 

18 

.(1)26751. 17185 

- .(3)39234 S4904 

.(5)75451 05S84 

18 

19 

.(1)23903 60285 

-.(3)33115 82259 

.(5)60210 58653 

19 

20 

.(1)21487 88465 

-.(3)28206 34400 

.(5)48631 62758 

20 

















2S1 


$ 

4 

5 

6 

7 

8 
9 

. 10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 


S.31SSS S55S6 
3.15206 34920 
2.28176 36680 

1.79717 I7I72 
1.48530 58058 
1.26686 60968 
1.10499 84438 
0,98009 86125 

0.88072 97705 
0.79975 28076 
0.73247 36327 
0.67567 56659 
0.62707 92809 

0.58502 29532 
0.54826 66002 
0,51586 56594 
0.48708 78486 
0.46135 66062 


3 


“ 2.31111 lini 

- 0.86984 12698 
~ 0,46490 29982 

~ 0,29090 90909 
“ 0.19962 2S996 

- 0.14562 47456 

- 0,11098 66169 
-.(1)87421 16730 

(1)70654 7S135 
.(1)58297 25830 
(1)48925 37414 
(1)41647 89943 
(1)35883 40140 

(1)31239 29587 
(1)27442 67469 
(1)24299 01253 
(1)21666 60247 
(1)19440 22295 


C, 


y. 




0,19555 SS556 
.(1)46349 20635 
.(1)18059 96473 

.(2)88888 88889 
.(2)50260 85026 
.(2)31176 23118 
.(2)20666 04420 
.(2)14402 01812 

.(2)10436 86658 
.(3)78047 SS631 
.(3)59889 86859 
.(3)46958 802S2 
.(3)37500 17543 

.(3)30421 80906 
.(3)25019 32673 
.(3)20824 49142 
.(3)17517 73888 
.(3)14875 87453 


3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 



1.27407 40741 
0.31746 0317S 
0.12768 95944 

.(1)64197 S3086 
.{1)36852 03685 
.(1)23117 62312 
.(1)15458 03507 
.(1)10847 37988 

.{2)79052 85098 
.(2)59393 10287 
.(2)45755 21097 
.(2)35996 92489 
.(2)28829 97519 

.(2)23447 56961 
.(2)19326 72711 
.(2)16118 23181 
.(2)13582 79996 
.(2)11552 71.331 


- 0,11851 85185 
-.(1)19047 61905 
-.(2)56437 38977 

-.(2)22446 68911 
-.(2)10656 01066 
-.(3)56980 05698 
-.(3)33198 46457 
-.(3)20639 83488 

-.(3)13501 76788 
-.(4)92017 48332 
-.(4)64862 67356 
-■(4)47031 15814 
-.(4)34930 30313 

-.(4)26484 45363 
-.(4)20444 80223 
■.(4)16033 34937 
.(4)12750 47715 
■■(4)10266 67694 


.(1)11851 85185 
.(2)12698 41270 
(3)28218 69488 

.(4)89786 75645 
.(4)35520 03SS2 
.(4)16280 01628 
.(5)82996 16142 
.(5)45866 29974 

.(5)27003 53577 
.(5)16730 451S2 
.{5)10810 44S59 
.(6)72355 62791 
.(6)49900 43304 

.(6)35312 60483 
.(6)25556 00279 
.(6)18862 76396 
,(6)14167 19683 
.(6)10807 02835 _ 


3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 
1 ^ 
18 

19 

20 
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CURFE APPJiOXIMATlOM 


p 

'^6 

^6 

^6 

P 

3 

31.86666 66667 

- 2.95555 55556 

1.24444 44444 

3 

4 

9.86349 20646 

- 1.68888 88889 

0.31111 mil 

4 

S 

6.24338 62423 

~ 1,20740 74074 

0.14814 81481 

5 

6 

4,87864 28716 

“ 0.94276 09428 

.(087542 08754 

6 

7 

4.17145 81715 

- 0.77415 17742 

.(1)58016 05802 

7 

8 

3.7S27S 83528 

- 0.65703 18570 

.(041336 44134 

8 

9 

3.47631 97S77 

- 0.57083 96179 

.(030970 33685 

9 

10 

3,28104 57516 

- 0.50471 27623 

.(1)24079 80736 


11 

3.13617 37676 

- 0,45235 63811 

.(019263 84589 

11 

12 

3,02462 67843 

- 0.40986 52428 

,(1)15764 04780 

12 

13 

2.93620 42788 

- 0.37468 59903 

.(1)13140 09662 

13 

14 

2,86445 51800 

- 0.34507 78315 

.(1)11121 84648 

14 

IS 

2.80510 85568 

- 0.31981 26863 

.(2)95.359 72754 

IS 

16 

2,75522 87178 

- 0.29799 91485 

.(2)82670 73153 

16 

17 

2,71273 35648 

- 0,27897 43935 

.(2)72358 73754 

17 

18 

2.67610 66890 

- 0,26223 52558 

.(2)63864 45096 

18 

19 

2,64421 82442 

- 0.24739 28474 

.(2)56784 05678 

19 

20 

2,61620 96162 

- 0.23414 18341 

.(2)50820 05082 

20 


P 

^<5 



P 

3 

~ 0.88888 88889 

10.85234 56791 

- 3,56543 20985 

3 

4 

-.(1)12698 41270 

28.89171 07594 

- S.S167S 48502 

4 

5 

-.(2)42328 04233 

1.42790 35865 

- 0.19461 49324 

5 

6 

-.(2)19240 01924 

0,86407 78152 

-.(1)91881 78077 

6 

7 

-,(2)10360 01036 

0,58217 60133 

- ,(1)50816 67304 

7 

8 

~ .(3)62160 06216 

0.41986 26065 

-,(1)31099 76443 

8 

• 9 

-.(3)40221 21669 

0,31750 862SS 

-.(1)20431 95246 

'9 

10 

-.(3)27519 77984 

0,24868 72810 

-.(1)14149 24384 

10 

U 

-.(3)19656 98560 

0,20013 83997 

- .(1)10205 88045 

11 

12 

-.(3)14529 07631 

0,16458 67560 

-.(2)76047 24004 

12 

13 

-.(3)11042 09800 

0,13776 06312 

- ,(2)58190 22630 

13 

14 

-.(4)85882 98444 

0.11701 40143 

- .(2)45522 17628 

14 

IS 

-.(4)68U4 09110 

0,10063 52146 

- .(2)36284 76722 

15 


-.(4)54930 71863 

,(1)87476 39340 

- .(2)29389 67830 

16 

17’ 

-.(4)44943 31524 

.(1)76744 10943 

- .(2)24138 37900 

17 

18 

- .('^)37238 74691 

.(1)67875 46964 

--.(2)20068,35825 

18 

19 

-.(4)31200 03120 

.(1)60462 05713 

-.(2)16865 01032 

19 

20 

-,(4)26400 02640 , 

.(1)54201 64924 

-.(2)14309 22600 

2a 











H. B. H. armiNLBAP 


25.'! 


p 


H. 1 


P 

3 

(1.26864 19754 

1 26419 75310 

-,(1)98765 43210 

i 

4 

0.24409 17107 

1.19223 98589 

-,(1)56437 38977 

4 

5 

,(2)60764 25632 

(1)30570 25279 

-,(2)10346 85479 

' 5 

6 

.(2)22147 39993 

.(1)11372 98915 

- (3)29928 91882 

6 

7 

.{.1)99719 21083 

.(2)51964 49641 

- (3)11182 23.340 

7 

8 

.(3)51454 71812 

.(2)27108 69377 

-,(4)49333 .18267 

8 

9 

.(3)29218 90503 

.(2)15524 5.1840 

- ,(4)24473 22709 

9 

10 

.(3)17816 .50932 

.(3)95299 97834 

-,(4)13250 26436 

10 

11 

.(3)11479 15011 

,(3)617.17 49551 

-.(5)76774 75857 

11 

12 

,(4)77282 945,12 

.(3)41752 74961 

-,(5)46962 67091 

. 12 

13 

,(4)53932 11190 

,(3)29248 26113 

-,(5)30029 01554 

13 

14 j 

,(4)38778 42203 

,(3)21098 48164 

-,(5)19924 01348 

14 

15 

.(4)28596 27483 

.(3)15602 20206 

-,(.5)1.16.19 45170 

15 

16 

,(4)21549 40811 

.(3)11786 12081 

- (6)9.5910 77856 

16 

17 

.(4)16546 77743 

,{4)90694 43517 

-, (6)690.10- 58224 

17 

18 

,(4)12915 92277 

,(4)70928 04898 

-,(6)50706 35472 

18 

19 

,(4)10229 00232 

.(4)56268 09892 

-,(6)37922 29454 

19 

20 

,(5)82061 36861 

,(4)45209 40193 

-.(6)28818 74227 

20 


P 


3 

(2)79012 34568 

4 

.(2)28218 69488 

■•s 

.(4)37624 92651 

6 

.(S)8SS11 19662 

7 

.(S)2f.311 13742 

8 

.(6)98()6)6 765.33 

9 

.(6)42562 13407 

10 

.(6)20385 02210 

11 

,(6)10589 62187 

12 

(7)58703 33864 

13 

.(7)34318 87490 

14 

.(7)20972 64577 

IS 

.(7)13306 78214 

16 

,(8)87191 61687 

17 

.'(8)58749 43169 

18 

,(8)40365 08.578 

19 

(8)28()20 59966 

20 

.(8)20584 81591 
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CURl^n APPROXIMATION 


TABLE II 


rzpj/ 




-K 




(The number in parenthesis indicates the number of ciphers between 
the decimal point and the first sisnificant figure.) 


X 




P‘^ 

p^b 

0 

0,31250 00000 

0.27343 75000 

0.24609 37500 

022558 59375 

1 

0.23437 50000 

0.21875 00000 

0.20507 81250 

0.19335 93750 

2 

0.0937S 00000 

0.10937 SOOOO 

0.11718 75.000 

0.12084 96094 

3 

' 0.01562 50000 

0.03125 00000 

0,04394 53125 

0.05371 09375 

4 


0.00390 62500 

.(2)97656 25000 

.(1)16113 28125 

5 



.(3)97656 25000 

.(2)29296 87500 

fi 




.13)24414 06250 


X 

P--7 

P«(9 

- 

p^/0 

0 

0.20947 26563 

0.19638 06152 

0.18547 05810 

0.17619 70520 

1 

0.18328 85742 

0.17456 05469 

0.16692 3S229 

0.16017 91382 

2 

0.12219 23828 

0.12219 23828 

0.12139 89258 

0.12013 43536 

3 

.(1)61096 19141 

.(1)66650 39062 

.(1)70816 04003 

.(1)73928 83301 

4 

.(1)22216 79688 

.(1)27770 99609 

.(I).32684 32617 

.(1)36964 41650 

5 

.(2)55541 99219 

.(2)85449 21875 

.(1)11672 97363 

.(1)14785 76660 

6 

.(3)85449 21875 

,(2)18310 54687 

,(2)31127 92968 

.(2)46205 52063 

7 

.(4)61035 15625 

,(3)24414 06250 

.(3)58364 86816 

,.(2)10871 88721 

8 

9 

10 


.(4)15258 78906 

.(4)68664 55078 
.(5)38146 97265 

.(3)18119 81201 
.(4) 19073' 48633 
.(6)9.5367 43164 



/?* // 

p./Z 

P^O 


0 

0,16818 80951 

0.16118 02578 

0.1S498 10171 

0.14944 59808 

1 

0.15417 24205 

0.14878 17764 

0,14391 0944S 

0.13948 29154 

2 

0,11859 41696 

0.11689 99672 

0,11512 87SS6 

0.11332 98688 

3 

.(1)76239 10904 

.(1)77933 31146 

.(1)79151 01945 

.(1)79997 SS442 

4 

.(1)40660 85815 

.(1)43837 48770 

.(1)46559 42321 

.(1)48887 39437 

5 

.(1)17789 12544 

.(1)20629 40S98 

.(1)23279 71160 

.(1)25730 20756 

6 

.(2)62785 14862 

.(2)80225 46768 

,(2)98019 83833 

.(1)11578 59340 

7 

.(2)17440 31906 

.(2)25334 35822 

.(2)34306 94342 

.(2)44108. 92725 

8 

,(3)36716 46118 

.(3)63335 89554 

.(3)98019 83833 

,(2)14034 6S867 

9 

,(4)55074 69177 

.(3)12063 98010 

.(3)22277 23598 

,(3)36612 15305 

10 

.{5)52452 08740 

.(4)16450 88196 

.(4)38743 01910 

,(4)76275 31886 

11 

,(6)23841 85791 

.(5)14305 11475 

.(5)48428 77388 

.(4)12204 0S1O2 

12 

13 

14 


.(7)59604 64477 

.(6)38743 01910 
.(7)14901 16119 

.(5)14081 S9733 
,(6)10430 81284 
,(8)37252 90298 
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% 

p-/^ 

p = 16 

P-U 

0 

(1.1444(1 44481 

0.13004 0<)341 

0,13583 37506 

] 

0.13543 54201 

0.13171 75850 

0 12828 74306 

2 

0 11153 50518 

0.10076 46542 

0,10803 15281 

3 

.(1)80553 00290 

.(1)80879 21888 

(1)81023 6460S 

4 

(1)50875 637G8 

.(1)52571 49227 

.(1)54015 76403 

5 

.(1)27081 60072 

(1)3(HM0 85273 

(1)31019 40602 

G 

(1)13324 57177 

,(1)15020 42636 

(1)16653 08140 

7 

,(2)54500 61180 

.(2)65306 20158 

(2)76326 62309 

8 

(2)18050 8(i407 

(2)24480 82559 

(2) .31)530 64024 

9 

.(3)55200 60617 

(3)78367 44180 

(2)l()5f)« 30166 

10 

(3)13271 00548 

.(3)21008 02660 

.(3)31313 48640 

n 

(4)25522 80516 

(4)46886 50370 

(4)78283 71.500 

12 

(5)37811 60653 

.(5)83725 89946 

.(4)16106 63089 

13 

(6)40512 53200 

.(5)11548 30092 

,(5)26004 38482 

14 

,(7)27039 67724 

(6)11548 30092 

.(0)34831 46429 

IS 

,(0)03132 25746 

(8)74505 80597 

,(7)32654 40777 

IG 


(0)2.3283 064.36 

.(8)1070(1 60471 

17 



.(10)182(17 66(101 



p=/d 

P^I9 

P^zo 

0 

0,13206 1)5006 

0 128.58 ,S32(r) 

0.12537 (16876 

1 

0.12511 00417 

0.122)5 60546 

0,11040 06540 

2 

0.10634 35354 

0.10470 51897 

0.10311 87474 

3 

,(1)81023 64606 

(1)80908 SSS6S 

.(1)80701 62830 

4 

,(1)55243 30S04 

.(1)56284 21262 

(1)57163 6S34S 

S 

.(1)33626 41437 

.(1)35177 63289 

(]).36S84 73821 

6 

.(1)18214 30779 

.(1)19699 47442 

(1)21106 57973 

7 

(2)87428 67737 

.(2)98497 37209 

.(1)10044 15245 

8 

,(2)36080 05581 

.(2)43776 60982 

(2)50812 13640 

0 

.(2)13600 6S030 

.(2)17197 95385 

.(2)21025 71101 

10 

.(3)44034 50025 

■ .(3)59303 28916 

.(3)77004 27501 

11 

‘.(3)12147 47317 

.(3)17790 98'675 

.(3)24869 12126 

12 

.(4)28344 10407 

,(4)45912 22386 

.(4)00044 40355 

13 

(5)54850 SS620 

.(4)10043 29897 

,(4)16056 21904 

14 

,(6)85718 05666 

.(5)18260 54358 

.(5)34900 86273 

IS 

.(6)10300 0f)747 

(6)26853 74056 

,(6)50845 47897 

16 

(8)01677 06SOS 

(7)30689 98921 

.(7)83118 72080 

17 

(0)52386 80483 

,{8)25574 99101 

.(8)89858 07654 

18 

10 

20 

(10J145S1 91523 

1 

.(9)13824 31946 
(11)36379 78808 

.(9)70940 58673 
.(10)36379 78807 
(12)90949 47018 
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APPROXIMATION AND GRADUATION ACCORD- 
ING TO THE PRINCIPLE OF LEAST SQUARES 
BY ORTHOGONAL POLYNOMIALS’-" 

/?v 

riMijiis fmirKN', fh Si I nmmtv nf 

PREFACE 

In the present paper the inatliematical ilicnr) of approximation 
by orthogQn6al polynomials is given in its entirety and accompanied 
by nearly all of the necessary demonstrations This has necessitated 
some mathematical preliminaries 

Statisticians less mtciested in mathematics uill find it sufficient 
to read § 12 on the iccapitiil.itioii of the njierations, the Ucgmning 
of § 9 concerning the computation of the mean binomial moments 
and the mean orthogonal moinents, tj 11 dealing with the method 
of addition of ilillcieiiu's. and tlie ex.iniples of ijld In para- 
graphs § 14 and § U dealing with toi relation and in § 17 tieating 
graduation, one mat observe the cnd-foinnilae and skip the lest 
With the aid of these sections and the tallies, one may leadily 
employ the mctlmds and attain lesiills with a lers small amount 
ol labor 

/.//>■/./; OF CO\"//:\l.S. 

§ 1 Introduction 

§ 2 Sonic toinnilae of the Calculus of Finite Diffeieiices 
§ 3. Changing the oiigin 
§ 4, Changing the interval h 
§ 5. The problem of appioximation 
§ 6. The deduction ot the polynomial 
§ 7. Determination of the coefficient 

§ 8. Determination of the measure of obtained approximation 

♦Paper read by Professor Harry C. Carver before the 93rd Annual 
Meeting of the American Statistical Association held in Washington, D C, 
December 28th, 1931 
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§ 9. Determination of the binomial moments. 

§ 10. Transformation of the orthogonal series in Newton's 
expansion. 

§ 11. Method of the addition of differences. 

§ 12. Recapitulation of the operations, . 

§ 13. Examples. 

§ 14, First problem of correlation, 

§ IS, Second problem of correlation, 

§ 16. Some mathematical properties of orthogonal polynomials. 

§ 17. Graduation by orthogonal polynomials. 

Bibliographiccil and Hixtoriral Nntrs, 

§ 18. Chebisheff. 

§ 19, Gram. 

§ 20. Jordan. 

§21. Essher. 

§ 22. Lorentz. 

§ 1. Introduction, It ha.s been shown that in any ca.se of 
approximating a function it is advantageous to develop that 
function in a series of orthogonal functions. It was Fourier who 
first used such an expansion in his treatment of trigonometric 
series. Thevfirst expansion in orthogonal polynomials was per- 
formed by Legendre. In Legendre’s polynomials the variable 
is a continuous one, it takes on every value between ~1 and , 
Orthogonal polynomials with respect to a discontinuous variable, 
where X assumes only the N values 

deduced by Chebisheff'- who has treated the particular case of two 
orthogonal polynomials with respect to eciuidistant variables. 

Since then several authors have investigated this subject. Poin- 

“•Chebisheff. Sur les fractions continues, Journal dc Mathimatiques pure.s 
and appliquees 1858, T. Ill (Oeuvres Tome I, 203). 

Sur I’interpolation par la m6thode des moindres carris, Mem. Acad, Imp. 
de §t. P^tersbourg, 1859 (Oeuvres Tome I, p. 473). 

Sur I’interpolation des valeurs iquidistantes, 1875 (Oeuvres Tome II. p. 
219). 
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care- and QuiqiieH considered orthogonal polynomials of a discon- 
tinuous ■variable in the case of non-equidistant values. Grom' 
employed these polynomials for ■■,-1^0,1, ■■■, n, Jordan'' 

considered the general case of equidistant orthogonal polynomials 
for equidistant values of between a and b , the interval of two 
consecutive value.s of as being h . moreover he treated the jiartic- 
ular ca.se of polynomials relative to xi ^ O, which 

Chebisheff has examined in another way Esslier in his fir.st ixib- 

lication" used such polynomials with respect to 

and in his second^ for n . Lorentz" also iiitui- 

duced orthogonal polynomials for x = and for 

X> = ~2n+1, • • -1^1, 3, ■ ■ ■ ^ ^ TJ-J, the interval in this latter 

case being obviously equal to two 
In later publications I showed new methods for using orthog- 
onal polynomials for approximation” and graduation'® which per- 
mit the results to be reached very rapidly. In the present paper the 
general case of orthogonal polynomials for equidistant values of 
the variable is to be discussed; the formulae given are valid for 
all orthogonal polynomials of equidistant values such as the poly- 
nomials of Gram, Essher and Lorentz. These are al.so discussed 
in this paper. At the end some very useful tables are aiiiiendecl. 


- Powcairi,Ca.\cu\ des probability's, Pans, 18%. p 251^ 

" A. Quiquet, Sur une niethode d'iiiterpolatioii exposce par Henri Poin- 
care. Pioc ot the fifth International Congress of Mathematicians. Cam- 
bridge, 1913 p. 385, 

■*/. Gram. Ueber partielle Ausgleichnng mittelst Onhogonalfuiiktionen, 
Bull, de I'Association des Actuaires Suisses, 1915. 

’>Ch. Jordan. Sur mie scrie de Polynomes doiit chaque somme partielle 
represente la meilleure approximation d’un degre donne suivant la methode 
des moindres carres Proc. of the London Math. Soc. 1921. 

"F. Essher. Ueber die Sterblichkeit in Schweden, Lund 1920 
'F Essher On some methods of Interpolation, Scandia, 1930. 

"P. Loientz Der Trend, Vierteljahreshefte zur Konjuiikturforschung, Ber- 
lin 1928. Zweite Auflage, 1931. . . , , ■ 

"K. Jordan, Eerechnung der Trendlinie auf Grund der Theorie der kleiti- 
sten Quadrate, Mitt, der Ungarischen Landeskommissi’on fur Wirtschafts- 
statistik und Konjuncturenforschunfe 1930. 

Ch. Jordan, Sur la ddtermination de la tendance seculare des gran^urs 
statistiques par la methode des moindres carres. Journal de la Societe Hon- 
groise de Statistique, 1929. , 

'"Ch Jo) dan, Statistique Mathematique, p. 291, Pans 1927, Gauthier 
Villars. 
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§ 2, Some formulae of the Calculm of Finite Differences. 
Since the functions considered in this paper correspond to ;»s- = a, 
a-th, a-t-Zh^a-i-^h-s the increment/? being constant, the Calculus 
of Finite Diffeiences will he founrl very useful, 

By the first difference o\ Ffx)\va mean FfxYh)- F(Ai) which 
is denoted \iy /!XF(>cX so that AFfx)=- Ffx-i-h) ~FCx). 

The 77-//? dilfeience of will he defined as 

A^F(x)^A\A^'^F(ui)\. 


We shall term Ffx.) the indefinite .\um of ffZf'dnd denote it by 
Zi ff^Xil AF/^) = It follows that E fM’^FFxJ-YC, 
whole C is an ai Inti ary constant oi a ]?enodic function of 
periodicity h . 

The n-ifiMm of Ff’MfwnW be defined by 

This contains an aibitiary polynomial of the degree, con- 
sideiing only the iiolynomial and neglecting the arbitrary periodic 
function. 

It would be moie precise to add the iiicrenient /? to the above 

notation for the difference A and the indefinite sum Zf ■ Thus 

we could use A and X , resjrectivcly ; but since in our formulae 
, A? 

the increment will generally equal n we shall omit this index 
except in cases where doubt might otherwise arise. 

By the definite sum of f{x) between a. and b , the following 
sum is understood fb being equal to a + nb and r? an integer) 

9 

and is denoted by ^ 

E f(sc). 
x.=a 

It can be shown that if Ffc) is the definite sum of i0c)> the above 
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sum is equal to the dif¥erencc of the values of F 6c) taken at the 
limits, so that we have 

F ffx,) = ■■i-f6i>-h)=Fri>)-F6A). 

According to our definition it is evident that the value of the 
function fC.c) at the upper limit, i e. ffb) , is not included in the 
sum between a and b This terminology, although rather unusual, 
will be adhered to throughout this pajKr. 

We shall have occasion to emiiloy the followdng formulae of 
the calculus of finite differences, 

Ponnula of differencing by /'ails, or of a product' 

( 1 ) Zi = Upx) -d vCx-hh) ■ A uCx) 

Formula oi the n-th diflcicncc of u pioduit' 

( 2 ) ( s ) A uFx.r-sh)-Af r'ffx) . 

Ponnula of the sununation by parts, or of llu’ sum of n product: 

Z\u('xj y(x)-2) \Au(rc)-Zy6xc-i-hj\. 

Ponnula of the sum of a produil, u{x) being a polynoinud of 
the n~th degiee. 

2j[uiCc)-ypxj\ = u{x) FyC'^)-AuCxPZ,’y6ci-h) 

( 3 ) 

tAa(x)- 76^vPec+Zh)- • r?b). 

In the second member of this eciuation Zj yO^) contains one 
arbitrary constant, Zli/6k’^/f?,i)contains, besides this, one mure, and 

so on. Ultimately the second member will contain rr-f-I arbitrary 
constants — - but since the first member contains but one constant 
it is evident that after simplification all of the terms of the arbi- 
trary polynomial must necessarily vanish except the single constant 

term arising from F 

Generalised binomial coefficients. We shall denote the general- 
ized binomial coefficient of the degree by 

/;e I x('jc-h)-6»-^h)- ■ -(x-nh-eh) 

^ r7/fy- I . z ■ 3 • 


77 
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where the index /h is associated with the decrement. If h^I the 
index will be omitted, and the expression above will be equal to 
the ordinary binomial coefficient, . 

Let us mention the following well-known formulae : 




Expansion of a function in a series of generalised binomial 
coefficients: 


( 4 ) 






The generalized binomial coefficient can be expressed as an 
ordinary one by merely changing the variable, Thus if we place 
(x-^)/h = ^ we have that 



and consequently if we write ‘=-f{'s-hh'^) it follows 

that FfO)=A^ ffri) , and formula (4) may be written 


(4') 


F^) = FfoMff)- AFfOj^f^jA^FroJp 


Formulae (4) and (4') are two different forms of Newton’s 
series. The great importance of Newton’s formula to the statis- 
tician is not yet sufficiently recognized by the latter, since he 
nearly always develops his functions in power series in spite of 
the fact that he is generally primarily concerned with the differ- 
ences and the sums of his function. Now if a function be expanded 
in a Newton series as in (4) above, its m-ih difference and its 
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sum can be obtained immediately by means of the 
formulae : 


( 5 ) 








w/i , , 
Ai fCdJ 



fj n-rrf ' 


following 


(6) 






-~7p~ + 



Aimi 


These operations would be very complicated if f^^^were 
expanded into a power series. Although it is true that a power 
series would be more advantageous for determining either the 
derivatives or the integral of fCx), we may remark that the 
statistician hardly ever needs these quantities. And for nearly 
all other operations, Newton's formula is at least as convenient 
as an expansion in a power series. 

To illustrate the last remark — if corresponding to a 

given value of x is needed, then in the case of a power series 
it is necessary to compute the values of • • • ■ and the.se 

are obtained most readily by means of successive multiplication. 
In the case of a Newton series it is necessary to calculate 
( 3 i-a),(ifA)(>i-a-h),(:>i-a)(ji-a-h)(ji:-( 3 - 2 h), and these 


should also be obtained by successive multiplication. The formula 
for changing the origin and that for changing the interval of 
observation are given in the following paragraph and are as simple 
as those arising in the case of power series. If a statistician 
expands a function into a power series and needing the differences 
of the function for ;K«ot calculates them separately, he doubles 
his work since these differences are precisely the coefficients in 
Newton’s formula. In statistical research Newton’s series should 
always be preferred to the power series, 
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§ 3. Chunyint/ the nriejin In mathematical statistics it fre- 
(luenlly ncciiis tliat it is neicssaiy to chani^c the ori,c;in of a set 
of obsei vatioiis. For instanci,, if fC‘ 3 )\s the value of some quantity 
correspondins; to the first ot January of the year I9OI 
a =J<i 7 iuckry 190 J , 

h icpreseiits the iiitoiv.il of a year, and it follows th.il x 
represents January where ^ an inteqei If we 

know the Newton e.vpansion of f(J) m ,c^cneiah/ed limomial 


Coellli’iciUs 


lat IS 




, /x-a 1 f{'d ) 


and desiie the values of ffx:) counted fioni the iirsl of July 
of 1901 . then denoting 

C = July 1, 1901 

we need fl'x) e\\ia.w\e<.\ into a .senes of geneiah/ed hniomial co- 


efficients 


c 


'X-C 

J 


h 


, that is 


The coefficients of thn e\pati'-ion iiiiist he .so determined that 
X~c+^hw\\\ corre'-pond to July l^fl 991 -^^) Jhei'e are easily 
obtained by putting x-c 111 the fir'-t equation of thi^. pata.giaph, 




rJ 


A^fTaJ 

h ^ 


and also placing x = c in thes-zf/? difference of the .same eiiiiation, 
so that we have 


(7) ■■■■ JJl 


rj-S 
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Substituting these values into equation (oc) yields the required 
expansion. 

Remark. If c-a=: mh where m is an integer, from the above 
equations it follows that 

f(c)= f ( f {dr-wh). 

An example of this is given in § 13. 

§ 4. Changing the intcmil h . Sometimes a changing of the 
interval is needed in Newton’s formula , this occurs for instance 
in statistics when a function fCxt), giving the value of some 
quantity corresponding to the middle of the year X' . i,s known 
for several consecutive years = ./^ by its Newton expansion 
(4), and it is nece.ssary to obtain the values of the quantity corre- 
sponding to the first of each month. It would of course be possible 
to calculate these values by placing » ~ i/lZ, Z/IZ^ Z/lZ, • • • in 
Newton’s formula, but it is more advantageous to change the 
interval in formula (4) and deduce another one in which 

both the increment of the differences and the decrement of the 
generalized binomial coefficients are k^^l/JZ. The formula thus 
obtained leads, by the method of summation of the differences 
(§11), more rapidly to the results So, starting from formula 

(8) ^ ^ 

we have to deduce 

(9) 

To obtain this, it is sufficient to know that for ^=ot the differ- 
ences of the generalized binomial coefficients ^ system 
of differences in which the increment is k may be written 
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( 10 ) 



To obtain these numbers, A ^,let us write the following identity, 


f' X'S. \ x~A'mh + h 

* I "’4 i \l-’-4 ' — — . 

and then deduce the s-th difference. By formula (2) we have 


f Y - 


K-a^mh*h 
m 


'h ^ H 

Placing in this equation 4 , we obtain 




m ^ 



( 11 ) 




The complete solution of this Eqmtion of Partial Differences 
with the interval k would be very complicated, but one may 
readily solve it for some particular values of 3 and then deduce 
successively the other values oi A ^ starting from the initial 
values which follow immediately from (10). These are that 

^rrt'O, except that 
and 


Equation (11) can be solved first for a^rrr yielding 
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and secondly for s =/ from which we obtain 


/ k-Ccr?-lZ)h , 

I = — • 


SO that by multiplying we easily obtain 


/l' 


and thirdly for s^m-Iwe. may express successively the values of 


.77}-! . m-z 


♦»)« \/-/ 


and then multiply each/) by /c adding the 


products obtain the result 


m-t rn-Z /k ) 

-(m-l}k -(zj. 


The other values of . are obtained as indicated above. The 
w 

following table contains the numbers necessary for binomial co- 
efficients up to the fifth degree. 


A-* 




A‘.M^ 


A‘.Zk(i)^ Al.k^ 

KOk A!,.sk‘(S), 


K’CJk Ag‘i(Sk-7h)(ij, 


4‘- 


Af.k^ 
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s 

The numbers being known, we can immediately express 

in a Nmton series, with the increment equal to k, 


It follows from (8) that 


rj>./ ^ m-i-/ ^ O 

fM.z .4 


maO ^ 


V- 


and consequently the s-ih difference of 'f(^) for x=!di will be 

S / s Tn 


niss 


m 


When these values are placed in equation (9) the problem is 
solved. An example is given in § 13. 

§ S. The problem of approximation. The number of the given 
values will always be denoted by /V in this paper. The values 
Ao) yi-‘yz>" V yrt-i correspond to x^,<i+h, a-t-Zh,”-, b-h where 
b = d-f-rih. A parabola of the v-ih degree, is to be 

•determined according to the principle of least squares, that is, so 
that the sum of the squares of the deviations for 

=Oj 1^2, ••■j TY-l shall be a minimum. Hence the parameters 
must be so determined that the expression 


( 12 ) 


b 


\f„M-yY 


shall be a minimum. 
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To solve the problem in the ordinary way would require the 
solution of determinants of the yi-ih order. 'I'his would 
be very laborious, as those who have employed Gauss’s method to 
solve this problem know. It is fai more convenient to first expand 
the function f(x) into a .seiics of orlhocjoiuil [yolynoutmls 
Let U^‘=U^(x) be such a polynomial — it is termed orthogonal 
if it satisfies the following i elation 

for all values of \J diti’erent from p , . The expansion of ff^(x) 
can be written as follows 

(14) ^ 

where the are constant parameters which must he evaluated 
according to the principle of least squares. 

To render expression (12) a minimum it is necessary that the 
first derivative of '5 with respect to a should vanish for all 
values of \J . This will ptoduce 77 // equations which determine 
the 77 v-y parameters, namely 

X=<3. L -I 

As a consequence of relation (13) these equations are so simplified 
that we may write 

fi5) v^-y-O. 
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The second condition of a minimum, namely that the expression 


n+l n* ! 

z: L 

■0=0 M'-O 






.shall !)e a [jositivc definite form for all values of c2'£^^^and da^, 
is also .sattsfied since in consequence of (13) this quantity is 
equal to 



From (15) it may be concluded that the coefficients are 
independent of the decree v of the parabola of approximation. 
Consequently, if the coefficients a,, corresponding 

to a parabola of degree 77 have been calculated, then to obtain a 
further parabola of degree yjr-J it is sufficient to determine only 
one further coefficient, — the others will remain unchanged. 
This is of great impoitance. If the series (14) is limited at any 
term, the remaining expression will always satisfy condition (12). 

§ 6, Dednehon of the polynomial Uj. Instead of starting from 
relation (13) we shall employ the following equivalent formula, 

where (x) is an arbitrary polynomial of degree 777-/. If 

we were to expand Cx,) into a series of polynomials 

we would return back to condition (13) again. 

Applying formula (3), of the sum of a product, to the above 
expression yields 

dci, h^2h)- ■ ■ ■ 
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Now, X,U^(k) contains an arbitrary constant to which may be 
assigned such a value that ZlU^(d)=0. But jfv^Cx+h) contains 

an additional constant which can be chosen so thatZlC^l^r-4^d?. 

Continuing after this fashion we may dispose of all these arbitrary 
constants in such a way that the expression for the definite sum 
will vanish for the lower limit that is 


F rU 

m-l m 


= o . 


But in order that the definite sum may be equal to zero it is 
necessary for the above expression to vanish also for the upper 
limit, x- = h ■ But since is arbitrary for all values of s it 
follows each expression obtained for 0,1,2, ^w-^must vanish 
separately for x^b . From this we conclude that and 

(x-b)taxy%\ both be multiplying factors of l/fx-) , Considering 
for the moment only the first of these factors we may therefore 
write 

U(x.)= ( X-<i)ct)(x). 


Applying to this expression the formula for the sum of a 
product, (3), we have 

By successive summation we should find (x-a)(x-<x'h) 
(x-a -rnh-f-h ^ is a multiplying factor of 23 '^^(x) and that we 
can assign the following form to this expression 


( 16 ) 
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As (x,-t?) must also be a multiplying factor of 
same reasoning leads to the expression of U 




As this sum must be oi degree it follows that C is an 
arbitrary constant and we conclude that the general formula of 
the orthogonal polynomials with respect to is 

the following 


(17) U„M. Ca ■ 

Starting from this expression, there are two different ways of 

deducing the expansion of in N ewton-stnts, as has been 

shown in the paper “ . First, we can utilize formula (2), giving 
the m-ih difference of a product, and obtain 

(18) uj^j.ch 

Secondly, we can develop into a iV^ewton-series of general- 
ized binomial coefficients ^ . According to formula (5), we 
have then that 
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The 3-tltdiflcience in this formula can be written according to 
(2) in the following manner 






bu it follows that 


( 20 ) 


^ -i? /77r /' 


,s\/b d? 7* ynh 


cU'd finally pulling ni-f-v' into this expression, and determining 
the w-ih dilTereuee, we see that 

, 71 1 u„ -ch 

Let us note that - C. 

As Z! symmetric with respect to s and b , we can 

get two other formulae for irom (18) and (21) changing 

a into ^ and inversely. For instance, remarking that b-dsNh^ 
and that 


it follows from (21) that 
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The constant C is arbitrary, whatever value may be chosen 
for it. The orthogonal polynomials introduced by different inves- 
tigators differ only in the value attributed to C . As these poly- 
nomials are closely related to Legendre’s polynomials it seems 
most advisable to choose C in such a manner that for the 

limit of the polynomial l/^ shall be equal to Legendre's poly- 
nomial ^ . For this purpose, we must put into (19) and (21) 
and, b^l and, C=1 then, deducing the 

limit for h=0 , we obtain two known formulae for Legendre's 
polynomials.'^ 

The choice of tT is only important if we want to compute 
numerical values of corresponding to any value oi^ ; in 

this case C should be chosen so that the calculation of U^(k) 
shall be as short as possible. As we shall see later Essher in his 
first paper and also Gram proceeded in this manner. The above- 
given value that I chose for C is m this respect also very accept- 
able; and whenever numerical values of are needed, we will 
adopt this value, It will be shown that our problem can be solved 
in a general way by leaving the constant C arbitrary. 

§7. Determination of the coefficients are given by 

formula (15) ; but first it is necessary to know the value of 




To determine this quantity we shall apply formula (3) of the 
sum of a product to the following indefinite sum 

( 22 ) . . 

^f-1) 


proceeded ia this way in the paper ® he, cit,; formulae (19) and 
(21) of the present paper are identical with (9) and (13) of the first paper, 
onlj the notation is somewhat different, 
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Let us now deteimine the cjuantities they 

are easily obtained, if ju <777//, by deducing the ^./////^differ- 
ence of 23 U^(x-^Aih-h).'^\,zxi\\\o from formula ( 20 ), after having 
replaced /rr by it follows that 

( 23 ) 

r / ^+M^-h-b\/'5 \Ab-&+mh\ 

tj s-m,ju)(rn)l ^rr,-S 

At the upper limit, x=.b the first generalized binomial coeffi- 
cient figuring in this formula can be expressed by an ordinary one, 

(Mh-h \ 

but since ^5 - w, it follows that this expression is equal to zero ; 
indeed an ordinary binomial coefficient ( ’f) is equal to zero, if r 
and / are integers, and if r< 

H^U^fxfMb-h) being symmetrical with respect to 3 and 

b , its(w-///,^'difference 27 '^Z^/%z//^./f>^^ be symmetrical too; 

but this quantity, as we have seen, is equal to zero for , 
therefore it must also be equal to zero for x=a. . 

We conclude that at both limits and all the terms 
of ( 22 ) will vanish in which — that is all terms except 
the last. 

To evaluate this last term, we shall determine first the indefinite 
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sum of From (20) we easily get, after putting ?c+mh 

instead of ?i, 


L 




f^2TT)*l 

C. 

h ^ 

'' Ja777 


,x:^mh-b) /B 
( 6-f-J T.i. w/( 


ti-afTnh\ 
I?m-s h • 


Since mKs+J , this expression will vanish at the upper limit 
and its value at the lower limit, , will be 


2: UJa-^wh)^iZ fmX 


^Zm-hl 

h 


S-t>-hTT7h\ / b-a.+ rnh'S 
< 3 -*-T Ah ' ^777-a Ah' 


Noting that b-<a=Nh,'f^t can express the two generalized bi- 
nomial coefficients of this formula by ordinary ones; indeed they 
are equal to 


/-A^y-rt7\//Vi‘OT 1 , rA-m-t-s 


AZj )f Zfjh 


so that we have 






zm+l 

\)z: f-i) 


a=rn 


s-t-l/s 's/’Znr^i) 
s^l A 




According to a known formula of Comhimlory Analysis 

(Netto, p. 250.) the last sum of this equation is equal to (~J) 

So it follows that 




m 


). 
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Moreover from (21) we deduce that 


Kow we have determined the values, at the limits, uf all the 
quantities hqunn,!,'- in equation (22) ; hence the definite sum will he 

It i.s useful til leniark that this exjne.s.sion is independent of 

z z 

the oii,i,nn ol the vatiahle x Let us note also that Z! = C ^ . 

To determine the coefficient from equation (15) il is still 
necessary to determine For this ])ur])ose. let us siart 

from equation (21'), that i.s from the AVre/wii-serics of with 
respect to the , generalized binomial coefficient , We have 

(25) C 

iterf v^U 

b 

In ihf analysis of continuous variables the quantity ya;^y (S'-?: 

d 

is known as the moment ofy of the degree (we will say 
the aA/? power-moment). We have seen that in analysing equi- 
distant discontinuous variables it is not advantageous to operate 
with powers, Imt that it is far better to express the quantities in 
hinomin! coefficients. Indeed if an expression were given in power- 
sen ies, we could not consider it to be a full solution, as it would 
still be advantageous to tiansform it into a binomial series. There- 
fore we have no use for power-moments and shall introduce 
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binomial moments. The generalized binomial moment of degree 
\/ will be denoted hy3^ and defined by the following equation 

(26) 

fC” d 

This can be easily expressed by ordinary binomial coefficients, 
for if we introduce a new variable ^«.(x-<x)/h , we have 

(27) (l)y. 

f 

As will be shown later in (§9) there is a far better method for 
rapidly computing the binomial moments than there is in the case 
of power-moments. 

Several statisticians have remarked that it is not advisable to 
introduce moments of high order into calculations. In fact, if N 
is large, these numbers will increase rapidly with the order of the 
moment, will become very large, and their coefficients in the 
formulae will necessarily become very small. It is very difficult 
to operate with such numbers, the causes of errors being many. 

To obviate this inconvenience, I have introduced the mean 
binomial moment; that of the \J-ih dtgrtt will be denoted by 
and defined by 



The sum figuring in the denominator is according to § 2 equal to 






so that T xJ will be 
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( 28 ) 



If we introduce the variable ^ mentioned above, we obtain 



Consequently the mean binomial moments are independent of 
the origin of the variable z and of the interval h . The binomial 
moment increases rapidly with /V and also with v* , though less 
rapidly than the ordinary power-moments. However, the mean 
binomial moment remains of the order of magnitude of y , what- 
ever /V or V* may be. For instance if y»/r, it follows that 7(y=4' 
for any value of v^. 

Substituting in (2S) the value of obtained in (28), we have 


L U^My^Ch 

Xod 




Z7 ( 

\J^O 


m*\A\ 
m 




To this expression we can give the following form, 





To simplify we shall write 


( 29 ) 
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These numbers are integers, and as they are very useful are 
Xrresented in the following table, which juescnls all the numbers 
necessary for parabolas up to the tenth degree. 


Tattle for f3m\A 



0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

T 

T" 

1 










2 

1 

-3 

2 









3 

-I 

6 

TO 

5 








4 

1 

TO 

30 

-35 

14 







s 

-1 

IS 

-70 

140 

-126 

42 






6 

1 

-21 

140 

-420 

630 

-462 

132 





7 

-1 

28 

-252 

1050 

-2310 

2772 

-1716 

426 




8 

1 

-36 

420 

-2310 

6930 

-12012 

12012 

-6435 

1430 



9 

-1 

45 

-600 

4620 

-18018 

42042 

-60060 

51480 

-24310 

4862 


m 

1 

-55 

990 

-8580 

42042 

-126126 

240240 

-291720 

218780 

-92378 

16796 


The following i elation can he used for checking these number^: 
^mo * ^ml ^rrjjP /^m m~^ ■ 


Moreover, let us put 
m-hl 

If we already know the mean binomial momcnt.s, the values of 
0,^may readily be computed with the aid of the table above. 
Finally we obtain 

<C»c2 


As this expression could be termed the orthogonal moment of 
degree 777 of y , we could consider as a certain mean ortho- 

''^/(3,7;,pand 0^ of this paper correspond respectively to 
(-T)^0ot the paper loc, cit. “. 
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goml moment of y of degree m . These quantities, as we shall 
see, are very important; they are independent of the origin, of 
the interval, and of the constant C . Thus, they are valid for all 
orthogonal polynomials. 

In particular ^ = .7^ = .^//V is the arithmetic mean of the 
given quantities y . 

Finally, equations (IS). (24) and (31) yield, after simi)lifica- 
tion, the formula fora^ , that is, 


(32) 


a 


rr> 




The coeflicient is indei>endent of the origin of x . In jiar- 
ticular if we have <3^c /C. 

Now the equation of the approximating paraliola is known, in 
the form of its expansion into a series of polynomials (13), 
and our problem is solved. But if it is necessary to cominite a 
table of the values of the parabola frt(^) corresponding to 

A-hh, a+^h, , the corresponding values of (x) must 

first be calculated by formula (21'). Although this .seems easy 
enough, yet if /V is large, the computation is a tedious one even 
with the aid of Table IV which presents the values of the binomial 
coefficients, for integral values of ^ .If it should be neces- 
sary to compute { fjior non-integer values of ^ , the calculation 
must be made in the ordinary way, that is, by multi])lication At 
all events the calculation would not be shorter if the L^(<»?7were 
expanded into power-senes. The labor will be decreased, how- 
ever, if tables giving the values of corresponding to 

■■■■ , are available. I adopted this procedure in my 
paper published in 1921 and later Essher, and also Lorents, did 
the same^*. 


^**566 he. cit ^ ^ 
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It will be shown, however, that these tables are superfluous, as 
by a transformation of formula (14) into a N ewfon-seths, we can 
get the required values still more rapidly by the method of addition 
of differences; and if an interpolation is necessary for any values 
whatsoever of x , Newton’s formula will give it in the shortest 
way. 

Moreover, by this method we shall be independent of the value 
of the constant C , that is of the orthogonal polynomial chosen. 

We will give in § 10 formulae leading directly to Newton’s ex- 
pansion, so that the computer will have nothing to do with the 
orthogonal polynomials themselves. He will only have to compute 
the binomial moments , and then deduce the mean orthogonal 
moments dnii which will give, with appropriate coefficients, the 
solution in Newton's series, 

§ 8. Determination of the measure of obtained approximation. 
The approximation is generally measured by the mean-square- 
deviation , that is by the mean of the squares of deviations 
between the parabola and the given values y . It is expressed by 




I b 


f„M'y 


If in this formula we put in place of f„(x.) its expansion (14) 
in orthogonal polynomials, we shall obtain in consequence of the 
condition, (13) of orthogonality. 



and on account of (13) 
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We can still simplify this result, since from formulae (24) and 
(32) it follows that 


(33) 






z 

m 




And if to abbreviate we write 


(34) C„, - '"„■')/( "r; 


and note that c^ = i , then the mean-square-deviation will be 


(35) y^-ef-c,0,‘‘-c^e‘-- 


r? Tt 


The coefficients can be easily calculated by (34), u.sing 
Table IV for the binomial coefficients. But we shall see that 
is equal to the absolute value of a certain quantity, which we have 
denoted by and which is given in Table III for values of /Y 
up to 100. 

As the mean orthogonal moments are already known, therefore 
to obtain cf^ it is sufficient to compute IZ y^. 

Remark. All quantities figuring in (35) are independent of 
the origin, of the interval and of the constant C , consequently 
this formula is valid for all systems of orthogonal polynomials. 

Sometimes it is necessary to know , the mean-square of 
deviations between two parabolas, one of degree n , the other of 
degree s , (where n >s), both approximating to the same values 
of y ; that is 
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If in this formula we place the values of fn(x) and ex- 

pressed in series (13) of orthogonal polynomials, we have 

fz)- fg (z) « ^s-f-Z ^TT^n 


and in consequence of (14) 

n j b o 


then, using foruuilae (33) and (34) we find that 


^ns - , 


Having obtained the equation of an approximating parabola of 
degree ??, it may develop that the resultant approximation is not 
close enough, the mean-square-deviation being too large. We may 
then pass on to a parabola of degree 77y-i by determining only 
the one additional coefficient a , — the others do not change. 
For this purpose we must compute and then : the 

coefficient will be given by (32). The new mean-square-deviation 
will be 






'nv-i 


Reiiwk, If the binomial moments >.fl^were 

given, and we proceeded to determine a polynomial of degree n 
such that its first binomial moments should equal respec- 
tively to the values given above, thus employing the principle of 
moments, we should reach the same result as though we had 
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imposed the principle of least squares. In the case of polynomials 
both principles lead to the same function. 

§ 9. Detenimiation of the binomial moments. Chetverikoff has 
given a very good method for their determination, which dispenses 
with all multiplication, We have seen that 



Chetverikoff' s method produces the last sum, that is the ordi- 
nary binomial moment {h=l) ; to obtain the generalized binomial 
moment -Sethis must still be multiidied by but it is needless 
to carry out this multiplication, since we need only the mean- 
binomial-moment T\) , which is given in both cases by 


The method consists in the following: Let us denote by )/(^) 
the value of y corresponding to ^ ; in the first column of a 
table the values of >f<^are written in the reverse order of mag- 
nitude of ^ , that is 

y( N-l), y( N-2), yf 0 ) . 


Into the first line of all the columns we write the same number, 
y(7V‘j). Into the \J-th line of column/z we put the sum of the 
two numbers figuring in line ^-J. of column/z , and in line \) of 
column /j -I , 

In columnyz we stop at the line the number figuring 

there will be EfJ.^Jy ! to obtain the mean binomial-moment of 
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degree /j-2 this must be divided If we want the mean- 

binomial-moments Tj, T^, T„, we must compute ?;// columns. 

An example is given in § 13, 

An elementary demonstration of this method is given in the 
papers loc. cit. “ and We will give here a more direct one. 
Let us denote by^i^^yi^the number written into the sJ-ih line 
of column//. The rule of computation will be 

(fif -/> (pf \X-1, m)- ’ 


This is an equation of partial differences of the first order which 
may be written as follows : 


( o<, ) tA/, // - (pf O . 


Let us solve it utilizing Laplace's method of generating functions. 

We will call u= generating function of PP ^/^^with 

respect to v , if in the expansion of a >n powers of t the coeffi- 
cient of equal to pfv,/x)^ where is a parameter of a • 
5ince O , then we have, 

• • • • 

From this we easily deduce the generating function 
If we divide both members by -t , the coefficient of if in the 
second member will Hence u('tyu)/t is the gen- 

erating function sought. Since // is a parameter, it follows from 
the preceding that the generating function of pf'J, be 
i and that of be . If in 
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equation (o<i ) above we substitute the corresponding generating 
functions for the function , we obtain 


(/5) (1-t) uft,/j+l)- . 


In this way we have reduced the partial difference-equation ) 
of the function (p to an ordinary difference-equation of its gen- 
erating function a . The solution expanded into powers of if , 
will give the function (p itself, which is sought. 

Equation (/5 ) is a homogeneous linear equation with constant 
coefficients, t being only a parameter from this point of view, and 
can be solved immediately, yielding 

(/) 

where uifi) is an arbitrary function of t ; and may be determined 
by the initial values, that is, by the values put into the first column. 
Placing /J. = l into equation ( we obtain the generating func- 
tion of these values. Hence, 

cDCV/fP-tJ = / y^oJ i 


Finally we have 

u = ■••■J y(7V- 


s=0 


( yU-2 > 


Since 
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we may o))taiii the coefficient of / in the expansion of u 
By letting s = v^-?we have therefore that 


If we place in this expression v= and we see 

that 


4)(h/-M*A m)- 

We conclude, therefore, that the number figuring in line 
of column yU will he the ordinary binomial moment of degree 
/i-y . It was this that was to be demonstrated. 

§ 10. Tronsfonnation of the orthogonal series into Newton's 
expansion. Since the approximating parabola and the mean square 
deviation are independent of the constant of the orthogonal poly- 
nomial used, it is natural to transform equation (13) so that it 
.shall also be independent of this constant. This can be done by a 
transformation into Newton's series. 

We have seen that the coefficients of Nezoton's series (4) are 


(a). 


To obtain this series, therefore, it is sufficient to determine these 
quantities, starting from the orthogonal expansion. 

Deriving from formula (13) thes-//> difference of f^(x) , we 
have 


77-^Jl 

aXnxj^z: 

^ 7 ) <5 


" 77 ? 




( 36 ) 
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To obtain , we start from formula (21'). Since f Z'-t 

is a multiplying factor of the s-th difference of we conclude 
that for z-'Si , all terms obtained from (21') will vanish except 
the term in which . Hence we have for 


Placing this value, and that of .3^ taken from (32), into equation 

(36) we get 

r^v - ricjf'" - 

m=s { y 

where is the mean orthogonal moment of § 7. To abbreviate 
we. shall write 

(37) C f M+m ) ' 

' 7 /? / 


Finally we find that 


( 38 ) 

For instance, we have for s=^CJ , 

Cao ^no 

where Qq =J, Let us remark again, that the second member of 
equation (38) is independent of the origin, of the interval, and of 
the constant C of the orthogonal polynomial chosen, 
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The importance of the numbers was first recognized by 
W, Kviatovssky, who calculated a table for these numbers. This 
table has not been published. The author’s Table III is more 
extensive, giving these numbers with more decimals for A' up to 
100, and for parabolas up to the .seventh degree. 

Having obtained, by the above method, the Newton expansion 
of the approximating parabola, it may happen that the expansion 
corresponding to a parabola of degree n-hl is desired, and this 
requires the calculation The coefficients of the new 

binomial expansion of Cx) are easily deduced from those of 
previously obtained, since 


The work previously done is therefore not lost. 

§ 11. Method of the addition of differences. Knowing the co- 
efficients of Newton's formula we can 

proceed to calculate a table of the values of fyj(k) corresponding 
to + A~^£h/-,b-hhy adding the differences. This method 

has been used by H. Henning in his remarkable paper on the 
Trend-lines.^' It proceeds as follows. The function 7^ being 
of degree 77 , it is evident that a constant. 

Into the first line of the first column of a table we shall write the 
number Into the other lines of the first column we 

put the number of the preceding line of the same column, increased 
hy We stop in this column at line /V- r 7 -f-J . Accord- 

ing to Newton's formula, we have 


!■*//. Henning, Die Analyse von Wirschaftskurven, Vierteljahreshefte zur 
Konjunkturforschung. Berlin 1927. 
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or 

^3 f„ M. ^ ri)-f "f„ Ce ) 

where (x-a)/h\ hence the first colmnn will coiitam the valhes 
of differences of It is advisable, before con- 

tinuing, to check the last number in this column by the formula 
above, putting therein <f= N-v . 

Into the first line of the ju.-'th column we write the number 
^fid into the \)-ih line of the same column the sum 
of the numbers figuring in line \)-i of column and column 
/u . The computation will be stopped in this column at the line 
N-r)+yU. The column will contain the values of i'c\t(v-ji^4h 
differences of , which follows from Newton's formula 


or 

(39) 

Before going on, the last number of the column/^ should be 
checked by the preceding formula, putting into it N-yt+ju-i . 

We continue in the same manner, — the last column to be 
computed is the rt-th^rA will Contain the values of 
. . . , f^(b-h). This last number can be checked by formula (39), 
by making the substitution//: '•r? An example is given 

in § 13. 

§ 12. Recapitulation of the operations. To solve the problem 
of approximation of § 5, it is necessary first to compute the mean- 
binomial -moments ^ 7], “’,7^. This is done by drawing up 
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Chetverikoff's table (§ 9) and dividing the number in the line 
column// ■ We obtain in this way 2^.^: this 

must be repeated in every column. 

Then the numbers are taken from Table L, and the mean- 
orthogonal-moments , ^,-”^are calculated by 


(30) 

Then is computed. The numbers are taken 

from Table III, or if this table fails, calculated by formula (34) 
and by Table IV which gives the -binomial coefficients, The mean- 
square-deviation is calculated by formula 

( 35 ) cs„^‘tyfri -ey-c,eA* e„‘ . 

If this quantity is conveniently small, the approximation is con- 
sidered close enough; if not, we proceed to calculate , and 
and so on until a sufficiently small mean-square-deviation 
is reached. 

Now we proceed to deduce the Newton's expansion of the 
required function The numbers are taken from Table 

III, or if this table fails, calculated by formula (37) with the aid 
of Table IV. 

The constants of Newton’s formula are given by (38) 

1^/ O/? ^no 

^ ^77 (q)= C„i 
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n2 






ss 




c. 


ns 




A”f„M.c„„ e„ 

Now the equation of the parabola is known in the form of its 
Nf’zc'ton-series 

( 4 ) 

wheie ^ r=: Cj^-a.) /h- This will be considered as the desired solu- 
tion, As has been said, it is quite useless to expand the parabola 
in powers of . 

If it is necessary to deduce a parabola of degree t?// starting 
from equation (4) ; the new coefficients will be 

^ 77^1, s • 

Generally a table of the values corresponding to the parabola 
and iQ x=a,^*h,--,b~hy is needed; this will be computed by the 
method of addition of differences (§11). The last column will 
contain the required values. 

§ 13. Example 1. Let us choose an example given by 
Lorcntz'-’^ in which six values of y are given, and where yV»<5, 


^“loc. cit. p. 21. 



m APPROXIMATION BY ORTHOGONAL POLYNOMIALS 


/?=/? and a=-5’ . The approximating parabolas of degrees 1, 2, 
3, 4, and 5 are required. 

The values of y corresponding to x= y, are 

written in the first column of the table below in reversed order of 
magnitude of x . The other numbers of the table are obtained by 
Chctverikoff's method (§9). 


12293 

12293 

12293 

12293 

12293 

12293 

1087S 

231fi8 

35461 

47754 

60047 

72340 

10058 

33226 

08687 

116441 

176488 


10018 

43244 

111931 

328372 



8530 

51774 

163705 




7880 

59654 






The required mean-bmomial-momcnis will be 

z -yp6y4/6^994y, * T, -J7646S/J^=my,&66T 

T, 6667 z - 77740/6 - 170/6, 6667 
Z -776/77/70- 11416, 6 T, - 7779/ 

The nuan-orthogoHal-inomcnts arc 

9947,763^ 

971, 7736 

-77 ^ 7 ,^ 76,7773 
= 7Z -70Z +6T, -T, - 187 
- 14Z -y7Z^70Z -lOZ ^ Z = 666 7 

&s -- 47T^-176Z^140Ts - 70^ ^17Z -T, ^-11 07 

The squares of the mean-orthogonal-moments are 

98849967,46 = 77489 

e" = 94 7468, 76 ©/= 177669,4/ 

1484, 67 ej - 177 7104 

HyVe - 100960410, 7 


♦Editor’s note. In this country we usually write g * 9942. J^jfJinstead 
of 9942,3^31. For the purposes of this paper I prefer to use the con- 
tinental notation appearing in Professor Jordan’s manuscript. I agree that 
the following typical product of three factors ^ ■ 34-1, 77393 ' f / 
appearing on page 300 is less liable to be confused than if written 

l’341.77'795//y^). 
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From Table III we get: 

C^^O,Zi42d^71 
= i, 76771429 Qo -0^02380972 

- -o.a^m}}} 

Now the mean-square-deviations for the parabolas of degree 
0, 1.2, 3, 4, 5 will be: 

= 2110424, dZ 
65664,05 
4 - erf- C, Of = 66012, 36 

^ - 36105,03 

al - cff - - 61604,49 

6,80 

As the parabola of degree S passes through the given six 
points, therefore (3f should be exactly equal to zero. 

Let us now proceed to the determination of Newton's formula 
corresponding to the five parabolas required. It is to be observed 
that the amount of work required to solve this problem, is inde- 
pendent of the number of observations. First the following num- 
bers must be taken from Table III : 

C,r-0>657142B6 1,07142857 

Ogf =-2,14285714 - '2,3 Css - 1, 

0,,=2 €,, = 1,92857143 O 

C,, = -0, 65714286 €^,= -0,5 Cg, - 

Q,- 0,14285714 €,, = 1,66666667 
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Formulae (38) of the preceding paragraph give: 
f, r<^)- a ^ Of = 7S60, 904 30 

^ = 7929,71427 

f/a)= f^(a).C,^Q^ = 7777 .21427 

fja) = f, 7a Jf = 7d^2 ,07142 

f/a) = Ua)^Q^B^ = 7779. 99996 

^ B) - = 832. 37140 

= = 730,00003 

Af,Ca).Af^ (ah Oj,Oj = 11J6.. 00003 
Ai/a)=73f/a)f-C.^,0, = Q14.37114 

Af/a)=Af/a}*C^,0^ “ 630.00002 

^ 41,23360 

^-416.21432 
77f/a)=77f-/ahC4^G^ = 262,00003 
A-^-f/a) =Z)^/^ 7a) 0^ - 833,0000 3 

7a) > e'j = 303, OOOOO 

7a). A =- 703 , 00040 

A%7a).A^f^7a)^O)^^0^ =-2236 00010 

37 f„ 7a) = < 9 ^ = 1 053, 00010 

■37f^7a)-A'^fJa)^q.^0,^= 4511,00010 




-~3912 
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Bcforfe writing the formulae of the parabolas, let us introduce 
(x-cL)/h=:Z and put we have then 

7660, 905*862, 571 (F) 

7929, 71i*750ff)*41,8S6('p 
7^70- 7777, 2i4-*lJ16ff)- 4J6,2M7f)*3057f) 
17(7), 7852,571 '814,5T}(f)*262/f),T05(f}* 1055(1) 
17(0- 77a0*650(f)*85a(l)- 22e6(()*451J(P 
- 6912 (p 

These lesults are exact to three decimal places. 

Ihe method of the addition of difierences can he aiiplied imme- 
diately to these equation. 

Exaiiipc 2. The values corresponding to the appioximatmg 
parabola of the fifth degree and corresiionding to the given values 
of a: in the preceding example are to be determined 

Starting from the last equation above, we will determine 
for ^ ^ ^ Noting that A^FfO) - £>9^2 the table 

below IS obtained by using the method dcsciibed m 11 The last 
column contains the required values of which, as in this 

case the parabola passes through the given points, should be 


exactly equal to the given 

values y in 

Example 1 


4S11 -2i8r, 

8.1S 

650 

7880 

-2401 2225 

-1448 

1488 

85.?() 

-176 

777 

40 

10018 


601 

817 

10050 



1418 

10875 




1229,1 

d'lie results are exact to three decimal places. 



Remark on the number of decimals to which the calculations are 
to be carried out If for instance a parabola of the fifth degree 
is to be determined approximating the values of y , which are 
given with a preci.sion of half a unit, then should be cal- 
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cuiated to seven decimals if the number of the given values is 
near 50, or to eight decimals if it is near one hundred; A f(a) 
should be calculated to six or seven respectively; fa) to five 
or six, and so on. The corresponding orthogonal moments and the 
numbers must be of course calculated to the same number 
of decimals. 

Example 3. Changing the origin. The given values y in 
Example 1 correspond to 

JulyJjgOl 

tor O, I,2," where t, is expressed in years. These values 
have l)een approximated hy F^f'^), obtained in the last equation 
of Example (1), to abbreviate we shall write it F(^) . In this 
z&5t,i/u/yT,J90I corresponds to a It is required to develop 
the function Ff ^^in a series of binomial coefficients ( ^\f)> 
where c = corresponds to JANUAry iJ901 . Equation (7) 
will give for hsl and €-<3*-^ with an exactitude of three 
decimals 

4$11 ^§-6912 - 7967 
A'FHJ- -4^S6 -i -§ • 6912 = - 7135^5 
A^Ff-j)= -2236 • 4pll -6912=3632,62^ 

AF7-j) = 690- i • 638 -§■ -2286 -jj • 4.311 

-j§Q- 6912 ^-3923, 938 
F(~i) = 7780 -i ■630 x§'838 ‘2266 

^M6-7>912 = 11418, 102 
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Finally the required formula will be 


- n4JS, lOZ - 39Z5. 383Z, 6Z7f^^J) 

- rm 7967l'^lh-69izC^*h ■ 


This equation was checked by calculating F( O) which was 
necessarily equal to 7780. 

Example 4. Changing the interval. Let us suppose the follow- 
ing function given 


f(x)^77eO* 


650yrX+5 \ 


.636 /■ ^-^7 ] ZE86 X x-i- 7 \ 
‘ 2 ^2' 3 



This IS a A^ewfOK-expansion in which the decrement of the gen- 
eralized binomial coefficient and the increment of the differences 
are both h=Z. It is required to deduce another Neit/ion-expansion 
such that the mentioned decrement and increment should both be 

For this purpose it is necessary to calculate 7F-f(x)ivt x=-S 
&tid.~s= 4, 7 ■ Fitst we must determine the numbers 
introduced in §4. 

For /7=2, and k=4/3y^t have 

a' A= 1/6 7/7Z 

aI 77/1Z96 A^Jh\-937/3no4 

21707/933120 
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aI I 1/36 


Al/h">^l/ZJ6 
Al/h^^ 169/31104 


aI/h^- - 9/216 
A^/h^^ - 99/3496 

Ay - 9 /d64 

Ayh'^’’ 1/1296 


At /h - 5/9368 A// h 1/7776 

Now we can proceed to calculate the differences A ) given 
by the formula in § 4. We have 


A • 690 - ■ 636-^^^^ • 4286 ' ^9'13 

■ 636 ■ 2386 • 49 li 

271,76190 

A%).±.iie6-^f^.4sa-If^^-69ie =- T9977 

' ^ddd ■ 36960 

A^f(a) = - = _ o, 68869 

Hence 

f(x) = 7780- ^341, 77395/ 71, 76190^/) 
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-jp,(77,r997rX%^)t^ (K,3696f*/) 

-j^,-fO,88dd9X‘‘X) 

where k^l/3. If we want to apply the method of the addition 
of differences it is preferable to change the variable by introducing 
t^()c-A)/k and writing F('^)^i(a+^k). We have 


Fff)^ 77SO-341, 77 39 3 371^ 76l9(^) 

■I ^ 

- 77, 79977(f) ^13,3696(0- 0,83689(1), 


Of course it would have been better to change the variable before 
beginning to calculate the numbers , but we wanted to show 
the method in its generality. It is advisable to check the above 
equation by putting into it d : the result must be F(6j<= {(l )- 

3430. The checking has given this number with a precision of 5 
decimal places. 

Starting from A^F 70 =- 0,8886^ , let us determine the first 
values oiF(fJiox 0Q>)8,8, •"( by the method of the addition 
of differences 


12,36960 - 77,79977 271,76190 

11,48071 - 65,43017 193,96213 

10,59182 - 53,94946 128,53196 

- 43,35764 74,58250 

31,22486 


- 341,77395 7780 

'• 70,01205 7438,22605 

123,95008 7368 , 2140 (> 

252,48204 7492,16408 

327,06454 7744,64612 

358,28940 8071,71066 

8430,00006 


§ 14. First problem of correlation. It is required to determine 
the coefficient of correlation Prim between the deviations fr,(^)-y 
of a parabola of degree t? approximating to the values y (§3) 



.•502 APPROXIMA TION BY ORTHOGONAL POLYNOMIALS 


and of the deviations y' o{ a parabola of degree 777 ap- 

proximating to other values y' corresponding to the same a: 
quantities. Let us suppose that v t >7?. 

The definition of this coefficient is the following : 


(40) r, 


nm 


b 

~ £ 
YV xxd. 


i 






where cr^'^ is the mean-square-deviation of and y (§8); 
and the mean-square-deviation of /^/V^and y'. 

Let us put into (40) the values of and expanded 

into series of orthogonal polynomials (14); then the sum in the 
second member of (40) will be 


(+ 1 ) A:yy'-ZyU,u^^Au,^-.^'^uJ 

’•Ly‘(d^U^ 


This expression may be simplified ; indeed, starting from equa- 
tion (15) we have, after multiplication by a.^ 




Putting into this equation successively .s -■ L? •^and adding 

the results, we obtain the values of the second sum of (41). To 
have the third, we start from the equation analogous to (IS) 

tyV, . a; tv} 
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and multiply both members by <3^ ; putting successively 
• , n and adding the results, we obtain an expression for the third 
sum of (41). Finally this formula will be 


Syy'- f 


We have still to determine the quantity / N ; but 

according to (24) and (32) this expression is equal to C , 

j ‘ ^ m m' 

where 0^ is the mean-orthogonal-moment of degree m of y' . 
We conclude that, 


<42) r 


r? ?77 




Sometimes, n being given, the coefficients of correlation 
are sought for all values of 7n^OJ^2^ ??.- In this case we will 

first divide the quantity within the brackets in equation (42) by 
C2„ and then divide the quotient successively by the values of 
rn = OJ,2, ■■■, n 

All the quantities figuring in equation (42) are known from 
the previous determination of the approximating functions 
and except 2Jyy//V. To obtain the desired coefficient of 

correlation, it is necessary to compute this additional last quantity. 

Formula (42) also shows the importance of the mean-ortho- 
gonal moments of the given quantities y and y^; it is independent 
of the origin, of the interval, and of the constant of the orthogonal 
polynomial chosen. 

The most important particular case of is ; this being 
the coefficient of correlation of the deviations of y and of y ' 
from their respective averages. Thus shows the simultaneity 
of the variation from the respective averages. Another important 
particular case is , which gives the correlation between 
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and thus measuring the simultaneity of periodical devia- 

tions from the respective linear trend-lines. 

If n and m are large, the a]jproximating parabolas will follow 
the principal secular and periodical variations, will therefore have 
nearly the same maxima and minima as the values y and y' . 
In this event the remaining deviations will he mainly due to chance, 
and thus the coefficient of correlation loses its importance. 

§ IS. Second problem of correlation, Given the functions 
and of degree n and \/ resjiectively, approximating the 

values of a quantity y . Let us denote by ^ the deviations of the 
two corresponding parabolas: 


Moreover, let us likewise have given the functions (jc) and 
f^(») of degree W and/i respectively apiwoximating the values 
of another quantity y', and denote by 79 the deviations between 
the corresponding two parabolas. 

It is required to find the coefficient of correlation myj. 
between the deviations ^ and r) . According to the definition of 
th coefficient of correlation we have 


(.^' 5 ) m/u: 


1 


b 

L 



where denotes the mean-square-deviation of f , and 
the mean-square-deviation of r) (§ 8 ). Both are known from 
the determination of the approximating parabolas. 

Let us suppose that n't m>^ = ju- Substituting, for 

, f^(x) and their expansions in orthogonal poly- 

nomials (14), the sura in the second member of (43) will be 
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r] 


^n)(< 


^M+1 ^yW^i ^ 


-f- c5 




m rr? 


This yields, in consequence of the orthogonality of the poly- 
nomial Ug , 






As we have seen in the preceding paragraph, 

^a^a‘LU/.Cy0,&J. 


Hence 

'"n\J, tvm" a \ja ' 

V ^rvju . '- -I 

All the quantities of the second member of this equation are 
known from the previous parabolic approximation, so that the 
calculating of the coefficient (44) is easy. We note that it is a 
simple function of the mean-orthogonal moments, of the mean- 
square deviations and of the number . (formula 34 or table 
III). 

If m and jj. are g'ven and this coefficient must be computed 
for several values of n and \/ , we first divide the quantity within 
the brackets of formula (44) by<3j^^ and then divide the quotient 
successively by the different values of 

In the second problem also, the most important particular case 
is that of especially if n and m are large, in which case 
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the deviations f^(xJ-y'zm\A be considered as negligi- 

ble. In this case the coefficient of correlation (44) of the trend- 
lines is much more important than the coefficient of correlation 
(42) of the trend-deviations. 

Example on correlation. A, Sipos has determined trend-lines 
up to the third degree of Hungarian impohs and exports in 1882- 
1913.'“ The mean orthogonal moments for imports were 

0^ = J2U, 2^938 Qz - 70, 63941 

e, - ^06, 02671 Oj - 21, 27^41 

The mean-square-deviations corresponding to the parabolas of 
imports of degree 0, 1, 2, 3 were 

Cf^ * 166, 317 69, 930 

The equation of the third degree parabola of approximation was 

0)-d64, 12484 ^20, 3d 562^^)- 2.82064(^)^0, 49904{^) 


where X’=0 corresixmds to 1882 and is given in million 

gold crowns. 

The corresponding values for exjrorts were 


0)^1234, 4 
e',^ 192.679 

cr)^ 37987 
cr/* 96,662 


Oz- 37,80645 
ej- 5,98519 

cx) ■ 58, 480 
a)’’ 57,164 


iM. Sipos, Praktische Anwendung der Trendberechnungimethode von 
Jordan. Mitteilungen der Ungarischen Landeskotnnttsaion fdr Wirtachafti- 
atatUtik und Konjunkturfor&cnung. Budapest 1930. 
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The equation of the third degree parabola of exports was 

8n.24i05^15. 7^^). 

First problem of correlation. Let us determine the more impor- 
tant particular cases of the coefficient of correlation between 
the deviations y-f„(xJoi import and the deviations y'- 
export. 

The coefficient of correlation between the deviations of the given 
quantities and their respective averages is 

77 ^yy ~ ^0^0 > 

and since ^Lyy'=167^'^ J7, 60 we have 

9536 

This correlation is a very strong one. 

The coefficient of correlation between the deviations of the given 
quantities and their respective linear trend-lines is 

r,,^^,[j;yryy'-0,0'-C,e,e;^^ ^ 0,7669. 

(The number C/ was taken from table III. Cj * 4 8i618- ■ • ) 
This correlation is still strong enough. 

The coefficient of correlation between the deviations of the given 
quantities and their respective third degree trend-lincfl is 

'■« ■h’ \k^yy'-^<, 4- c,9,s;-Qqeyc, a, a ; ' 

3 3^ 

^qi739. i 
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This correlation is already very small, the deviations are mainly 
due to chance. From Table III we had -4, 14438^03^x16 
C3= 4.8a746663. 

Second' problem of correlation. The coefficient of correlation 
between the deviations of two trend-lin^s of import of 
degree n and respectively and the deviations of two trend-lines 
of export of degree 777 and respectively, is to be determined. 
We will only consider the most important particular case, the 
correlation between the third degree trend-lines and the respective 
averages, that is 

0o,3O ~ Z \pi 1 

JO 


where 


14Z639, ^^03 

and 

c , g\ ^ c, 110693, 9433. 

And therefore • 377,7023 b and 332^71000. 

The quantities in the brackets have already been calculated in 
the first problem, so that we have 




O, 9626. 


The obtained value is very near to that of r^o calculated above, 
proving that tfie third degree trend-lines well represent the given 
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quantities. This would not be true in the case of the second degree 
trend-lines; in fact we should obtain 9565 widely dif- 

ferent from obtained before. 

§ 16. SomC' mathematical properties of the orthogonal poly- 
nonmls, Symmetry of the polynomial , If we substitute in 
formula (18) a+b-h-x. for aj , we get 

~ch 


and putting s^rr?-/!, it follows that 


( 45 ) Urr,ra* b- h-xM-1) 


'J'liis formula shows the symmetry of the polynomial. 
Let us consider the particular case, 

x-a =^f'b-d-hj = kh. 


We have, then, 

b-d=: (2k+i)h , N~Zk+l 


x-A ~ kh 


x-b = -C k-fj)h. 


Liom (45) it follows that 

u„u^i<h}.(-jr u„ra*kh). 
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Hence we have 

From equation (45) we easily obtain 


and 

/fUrr, ( a-I-b-sh- x) . 

Function-equation of U^fx). This can I)e deduced in the fol- 
lowing way: let us develop a; into a series of orthogonal poly- 
nomials : we find that 


(46) 

as, in consequence of the orthogonality of these polynomials, the 
other terms vanish; indeed, if// is dififerent from m-l , m and 
m+l, it follows that 

z xU^u ^o. 

Jt* d ^ 

Since equation (46) holds for every value of x . , and U^(x) 
is known for three particular values of a: , we can determine the 
coefficients 

We know these values, since equation (21) gives for 


”As C can be dependent olm, we will write CY^n^) in the (ollowlna 
formulae, instead of C , 
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and for 35 • a , after changing a into b and inversely 


Moreover, in consequence of the above-mentioned symmetry of 
the polynomials; we have 

UJb-h).C-irUrr,(i). 


The two last equations give by using formula (46) 

(47) aU^f<3) ~ ^rn-1 
and 

(b-h)UJb-h).A„.,C/^/l,.hhA„ (b-h) 

Multiplying both sides of equation (47) by adding 

it to (48), we find that 

{h^a- h) a-h) . rb-h) 

and 


A^^jr(bi-d-h), 
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Multii)lyinR both sides of f47) by subtractin.t; it from 

(48) yields 


(b-6-h)Urpfb-h)’= iUr„.i(b-h) + z / 


Since 


U„ fb- h). Cfm)h Y ‘'"b '’) 


we find that 


^(b~a-h)C{m)h , Cfw-V 


(49) 


i-A 


rn + 


, - {b-a-rnh)(b~a-rnh-h ) 




As, in consequence of the symmetry of the |)olynumials, we have 

Y (b)-(-iruj^-h]-. crm)h Y 


hence we can deduce two other equations analogous to (47) and 
(48), u'., 
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and 


After multiplying the second equation by subtracting 

the result from the first, we obtain 




or 


J M b-dfwh 

(aO) j(b~<3+h)C(m}h 


Tn 




■^A^rn^l CCm+0 b 


^ (^t>~d+wh)(b-d-+rnh+h) 


ynfrn-^J) 


Finally subtiactmg (50) from (49), and simplifying, we hare 


A 


f r>-hi)^ C fm) 


hCfw-bi) 


^Ve deduce from (49) by substituting therein the above 
alue for , yielding 

hCfv?) 

C(m- 1) ' 
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Now the function-equation (4C) is known. 

We might have proceeded in another way. Having obtained 
A^hy using the equations (47) and (48), we could determine, 
for instance in the usual way hy multqilying both mem- 

bers of equation (46) liy , and summing x fiom c 2 to ^ , 

Since alread) known from (24), we need only 

determine the lirst memher and this may be done by applying 
formula (3) to the quantity 

Dljjm'ncc-cquatlon of U^fx). We will start from 


According to equation ( 1 ) we have 



Therefore 
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Appljfing to this expression formula (2), giving the TD-f-jth dif- 
ference of a product, it follows that 


(51) 

-mil 




Now let us deduce a second formula for We can 

write this quantity in the following manner 






Again using formula (2) to deduce the.(m-^2}-ih difference of 
the preceding product, we have after simplification 




'■( 3hyd-b)AV^ ■* {mi-i)(m*2)h^ J . 
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Finally, taking into account (51), this equation results in 
{ x-a+Zh)(z~b-h^h)A'^U^ 

'52) 

- TTif m-hi) » O. 

This is the required difference-equation ; it is a linear equation 
of the second order and can be solved by Boole's method.’® If we 
put ^^(x~A)/h the solution will be 



This expression of diffeis from those we obtained in par- 
agraph 6. 

The roots of fiJaO. L. Pejer''" has demonstrated the fol- 
lowing theorems concerning these roots; 

The roots of all real and single, they are all sit- 

uated in the interval 


dj b-h 


Whatever ^ may be. in the interval a+^h-t-h there is 
at most one root of 

V„M.O. 


Fejer showed moreover that if g^O'') is a polynomial of 
degree in, and if in its Newton expansion the coefficient oi(^) 


^^BooU, Calculus o£ Finite Differences, 1860, p. 176. 
*“See the Appendix in loc, cit. ” 
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i’.s equal to unity, the polynomial which minimizes the following 
expression 


is the orthogonal polynomial fx), with the constant C suitably 
chosen.* Indeed the first conditions of a minimum are that 


b 

L 

Z=:a. 




for 


\J< m 


and these are identical with the conditions of orthogonality (14) 
The second condition of the minimum is always satisfied in these 
cases, as has been shown in § 5. 

§ 17. Graduation by orthogonal polynomials. Let us consider 
an odd number of consecutive values of x , s>vy Zk+i , and the 
corresponding values of y . A smoothed value of y is wanted 
for the central term, viz. for x=cL-f-kh. This will be obtained by 
determining, according to the principle of least squares, a paia- 
bola of degree m , so that the sum of the squares of deviations 
between the parabola and the points x^y shall be a minimum. 

The equation of the parabola expressed in orthogonal polyno- 
mials (13) will be 

+ (x)-i- 


and the smoothed value required is given by (d+kh). 

In consequence of the symmetry of the polynomials, formula 


*See Essher’s first polynomial in §21. 
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(45), we have ^ equation of the 

paralwla will give 

Cai-kh)=> a^+ fa^kh)-)- kh)* • • • 


From this formula we see that it is useless to consider parabolas 
of odd degree, as for instance a parabola of the second degree 
will give the same smoothed value for y as would a parabola 
of the third degree. Therefore we will consider only parabolas 
of even degree, 

The values of r^^/^v^Vare given by our formulae (18), 
(21), etc,, but we can obtain a much simpler formula for them, 
starting from the Function-Equation of Cx) given by for- 
mula (46), which, since a+kh . we can write in the fol- 
lowing manner 




Um-i 

Cftri- i) 


•j- ^ ) 


Wm) 

h^Cfm) 


, Urrj^.1 _ 

This holds for every value of x, . For z- a-ikh the term in 
will vanish and we have 


(a*kh)=- 


2ki'l-m 2k^l*m C('w*i) 
m+1 ntfl C ( m-j) 




This equation can be solved by putting into it successively fn^^lAA" 
It follows that 
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Ugfa-t-kh)=> ~ k{ k-f-l)Cf^)h* 

U,faM).(^)(''f)ca)h^ 


and so on 


u,jaM)-r-jrh cun,)(;^)( 


For e?^^we have found m §7 


a 


{ 4 mil ) O. 


zm 


Zm 


X 777 ^ 


Hence if to abbreviate we write 


(54) - C-ir(4^*l)('l)^! 


^l!kili-2TTJ ^ 


m 


we have 


^Zm 


and finally the required smoothed value of y is given by 
(55) f^^faikkj- • 
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This formula is also independent of the interval and of the con- 
stant of the orthogonal irolynomial used. 

The values of iiece.'-sary up to paiaholas of the tenth 

degree and up to 29 ordinates are given in Table II, so the cal- 
culation of the graduated value is very simple. .\11 we need do 
is to compute the mean binomial moments by Chetverihoff's meth- 
od (§9) and calculate the corresponding mean orthogonal mo- 
ments by formula (30). This must of course be repeated 
for every value which is to be graduated. 

Example 7. Xine iioint graduation, employing second and 
fourth degree parabolas. 'I'he given value.s are 


% 

y 

X 

y 

0 

2502 

5 

2904 

1 

2548 

6 

3064 

2 

zm 

7 

3188 

3 

2675 

8 

3309 

4 

2770 




The mean binomial moments weic calculated by the method of 
§ 9, and were found to be 


2dJ9, 66667 
T, - 30 i 4, 97222 
T^- pir, 16667 


Tj-- 3182, 21429 

3229 , 87902 


Hence the mean oithogonal moments are 


6 ', = = 2839, 6666 7 O,,- 141, -33 ^307^ - JO T, 


e;,‘‘2T^-3T^*T^‘ 29, 08336 33146 


Fiom Table II. we take - 1,61618182 and 8,^.1,13266713. 



Table II. Gkadua'uox. 
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Therefore the smootheil value with a parabola of the second 
degree will lie 


^T<36, T<3 


In the paper loc. cit. " fp. 31) these values have been approx- 
imated by a parabola of the third degree, the value obtained for 
f^(4) was fp. 11)176^,787 63 to what has been said, 
this value should be equal to the obtained results for (4) . 

Finally the smoothed value corresponding to a parabola of the 
fourth degree i.s 

f^(4 ) * ^ (4)^ 3^6^ * 0037 


JlrBUOGRAPHlCAL AND HlSTORlCAI, NOTES.“" 

§ 18. It was Chehisheff who first introduced orthogonal poly- 
nomials with respect to a discontinuous variable. He*‘ especially 
treated the case of non-equidistant variables, from the mathemat- 
ical point of view, in a very interesting manner, but his results 
were necessarily corqplicatetl. As we consider here only equi- 
distant variables, this paper will not l)e discussed. 

But Chebisheff also investigated the case of equidistant poly- 
nomials in two of his papers. In the first "Sur I’Interpolation par 
la .Methode des Moindres Carres"^- he denotes the orthogonal 
polynomial of degree m by \ the variable x taking values 

differing by unity, from 7 TV-J) io J-7 TY-l) inclusive. His 
polynomials can l)e obtained for instance from our formula (21') 
by putting therein fri, <as-j( OZ-i CfrnO^ ■ I*'ormula 
(24) will give by putting into it h^i and 

where tt?./ stands for i,2j3i " w. 

““The numbers of the formulae quoted refer to the present paper. 

“'Sur les Fractions Continues. 18SS. Oeuvres, T, I. p, 201. 

“*Oeuvres 1859. Oeuvres, Tome I. p, 474, 
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In the second paper “Interpolation des valeurs equidistantes” 
in which he introduces such polynomialsr® he denotes the poly- 
nomial of degree m by (fi^C3c)w^^ the variable x takes the 
values 1, 2, 3, .... (n-i). These polynomials can also be obtained 
from formula (21') from (24), by putting in them 

/7.i) <s=i, /y>•^7-ian.d C = ('m!)^. 

§ 19. 7. P. Gram utilized orthogonal polynomials foi gradua- 
tion according to the principle of least squares.^* He denotes the 
polynomial of degree m by , the variable x taking the 

values 

■■■■.- lOj, ... 


where /V is an odd number. Then writing 
\j/'^(x)=L V. 


he determines the coefficients by foimulae 

Z! /x)=0 tor 0= s<m, 

/ t ''' 


There are m such equations and rn+1 unknown coefficients 
odg one of them therefore will be arbitrary. Gram disposes of 
the first coefficient which is different from zero in such a manner 
that all the values of corresponding to the considered 

values of x shall be integers and as small as possible. 

Tor instance in the case of , it follows that O , and 

in order to have = x , ht puts ^ - i 

Practically he uses only polynomials of the first, second and 

‘.**1875. Oeuvres, T. II. p. 270. 219. „ „ • 

s^Ueber partielle Ausgleichung mittelst Orthogonalfunktionen, Bulletin de 
1’ Association des Actuaires Suisses, 1915. 
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thiul degrees; and gives tables for and available 

for /V ^ li, and containing the corresponding values 

of 


The values of ^ obtained, for instance, from our 

formula (21'), if we put therein h=l, N-l) . and either 

C~-J , if n~lJ(N-2)\^ not divisible liy three, or <r='- j ii 
it is so divisible. 

The values of are also obtained from the same formula 

by putting into it h~l, ci^-^C 1^-1 ) and either C= - , if 

( P^'ZX N-3) is not divisible by five, or C-- ^/zo , if it is so 
divisible. C-^z corresponds to Xi fx:) ‘ n, . 

Formula (24) would give the values of 27 

of the table, if we were to .substitute /?«/ , and for C the 
respective value, s above-mentioned. , 

In order to give an e.Kample, Gram culculate.s the smoothed value 
corresponding to an eleven-point parabola of the second degree 
(p. 12), Ills calculation is short enough, but our general method 
IS still shorter, as may be judged from the following determina- 
tion of the smoothed central value of his example. 

In the first column we have written the values of y corre- 
sponding to a reversed order of magnitude of x ; the other col- 
umns have been obtained by simple addition, as Indicated in § 9. 


194 

194 

179 

373 

212 

585 

124 

709 

780 

1489 

000 

1489 

504 

1993 

244 

2237 

OOO 

2237 

582 

2819 

000 

2819 


194 

194 

5d7 

761 

1152 

1913 

1861 

3774 

3350 

7124 

4839 

11963 

6832 

18795 

9069 

27864 

11306 

39170 

14125 

... 
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7*0 = = ^3/9/ii ^^6, 272/2Y, . . 

T, = 1412’}/^^ ^ 256,81618... 

^ » 391 70/16 5 = 237, 393939. . . 


and 


e^-2r^-3r,./, ^-39,3939.. 


T(j lia\e the smoothed value, we take from Table 11., 

1,923 0769 4 

iiid It follow .s that the required smoothed value is 
kh). ^ vX, ^ = 332, 0303 
cijjieeiug with Graiii'i result 

.Mthougli the calculation has been executed to nine figure.s, it 
Is a \ery short one. 

§ 20. Ch. Jordan “Sui tine serie de polynomes dont chaque 
'Oinme partielle represente la meilleure approximation d’un degre 
donne suivant la methode des nioindres carres.”-“ (1921) In this 
paper the author treats the mathematical theory of orthogonal 
polynomials for equidistant values in the general case. Many of 
their mathematical properties are demonstrated; formulae, differ- 
ence-equations, function-equations of these polynomials are given 
and some interesting propositions concerning their roots are dem- 
onstrated. Two particular polynomials were introduced. In the 


‘“Proceedings of the London Mathematical Society, Vol. XX„ p. 297. 
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first, denoted by ^ takes the values a, a + h,a-f2hj---, b-h, 

where b’«a + Nh \ and N is an integer. 'The constant C was 
chosen as 

rr,!/^'”h 

This was done, as ha.s been mentioned, so that for am-i , b-*l 
and h=0 the limit of the [jolynomial .should be identical 

with Li'(]cndrc's polynomial. 

The .second particular ca.se denoteri by was obtained 

from by putting . a=.0 and There arc tables 

in this pa][)er giving the values of 4j 5, for 

for the values of x= N- 1 . 

Moreover there is a table giving for 777 = and 

for A^' mi-1, mi2, • • ■ 0 

'I'he problem of appioximation was solved in this ])ai>er by 
formula (14) of the present work in the following manner; — the 
coefficients were calculated from formula ( 13)above:r>-<;^(i} 
was computed first by multiplying every value of y by the cor- 
responding values of taken from the tables mentioned 

and then the products were added. The quantity was 

taken likewise from the tables. 

The coefficients < 2 ^ being known, the mean square deviation 
was calculated by formula (32'), And finally, the required values 
of frifx) were obtained by using formula (14), and by taking 
the necessary values of fx) from the tables. 

In a second paper “Berechnung der Trendlinie auf Grund der 
Theorie der kleinsten Quadrate’’-" the determination of the coeffi- 
cients c3^ has been much simplified. These were obtained by 
multiplying the binomial moments by certain numbers, and were 
easily calculated with the aid of a table of bindmial coefficients 
( y/ . These tables exhibit the values given for Af up to 55 and 

“‘'Mitteilungen der Ungarischen LandeakommlMlon fur Wirtschaftsstatistik 
und Konjunkturforschung. Budapest, 1930. 
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for \) up to ten, and are sufficient for parabolas up to the tenth 
degree. Although the calculation of was not necessary for 
the determination of , nevertheless it had to be evaluated 
for the determination of the mean square deviation. For this 
purpose a very simple formula was given for (p. 45). 

In this paper the method of approximation by orthogonal 
polynomials has been freed from the tables gi\'ing the values of 
these polynomials corresponding to the given » values, These 
tables would be very voluminous if we wanted them extended 
up to one hundred observations and to parabolas up to the tenth 
degree. 

This has been attained by giving formulae which permit us to 
pass easily from the orthogonal expansion of the approximating 
parabola to its NnAon series, which gives directly the required 
values by means of the method of addition of the differences and 
by calculating the coefficients without the evaluation of 

r.yC)„M. 

The third paper ‘‘Sur la determination de la tendance seculaire 
des grandeurs statistiques par la methode des moindres carres.’*” 
(1930) published somewhat later than the second, differs from 
it inasmuch as it introduces the mean orthogonal moments, giving 
(p. S8S) a table for their calculation for parabolas up to the tenth 
degree, — the present table 1. The coefficients , the quantities 

figuring in the formula of the mean square deviation, 

and those of ^figuring in that of the different coeffi> 

TV ” 

cients of correlation, are expressed by these moments. The calcu*' 
lation of became needless, which is very fortunate since 

“^Journal de la Socidt^ Hongroise de Statistique, 1929, T. VII p. S67. 
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these minibers are very larf»e if /V is large, and it would there- 
fore lie difficult to oj^erate with such numbers. 'I'he table of the 
binomial coefficients has been extended sufficiently for one hun- 
dred obseivations f fV up to 105). 

Finally, in the prcsciil ixiper the orthogonal [xilynomials are 
left in the general form, the arbitrary constant remains entirely 
in the liackground, the coeffu'tent.s arc no longer calculated, 
the mean orthogonal moments alone are used, and by the aid of 
these quantities the Newton exiiansion of the approximating para- 
bola and also the mean square deviation and the coefficients of 
correlation are directly obtained. Ml unnecessary matter has been 
cleared away. 

Table II, giving numbers useful for graduation, and 

Table III. rendering it easier to establish Kcidoii'x formula, are 
new. Table IV. of the binomial coetikients has been e.xtended by 
a few lines (up to 110) in ordci to .suHicc for parabolic appioxi- 
mation of the tenth degree by the new formulae. 

§21. M. P. Pxshrr "Ueliei die Sterblichkeit in Schweden 
1886-1914”-* denotes in this paper the orthogonal ixiiynomial of 
degree 777 x- taking the values of 

Since the coefficient of \s taken as equal to unity, therefore 
the constant C of the polynomial according to formula 

(21'), C=77i!/(^^), Putting this value into (21'), and writing 
h-= i ,(} it will give the values of Bsshcr's polynomial 

corresponding to a given x , Formula (24) gives for the above 
value of C and h= 1 1 

r CP ^ ^ /Vv /77 j 

In a iccoiid paper "(Jn Graduation according to the Method of 
Least Squares by Means of Certain Polynomials’”-'", Bssher has 

““Medelanden (ran Lunds Astrononiiska Observatorium, Lund 1920, 

““Forsakringsaktiebolaget Skandia 1855-1930, Stockholm, 1930, p, 107, 
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tmployed other orthogonal polynomials, denoting them 
the variable taking the values of 1,2,3,.. ., A' . Adopting 
Lore It::’ s point of view (§22), he chose the constant C in such 
a manner that should be equal to /V . From our formula 

( 24) we conclude that in this case 



In this way the expression becomes very simple, but the 

polynomials themsehes become complicated. 

Putting h=l . as i , and the abore value of C into formula 
(21') we have 


/y 




m*l 

E 

\ls.O 




X* m 1 

/77 


The coefficient in an expression by Essher's polynomials 
Is expre.ssed \ery simply by our method. Indeed from (32) we 
get, if we put in this equation /? = / and the value or C above, 


■'rn 



« 


w here is the mean oi thogonal moment of degree /n and C , 


rn 


m 


C 


rno 


taken from 


the number given by formula (34), or 
Table HI. 

As a comparison let us determine the graduated value corre- 
sponding to Age 52, Essher’s example 10. (p, 116) 

The first column below contains the given values corresponding 


10 X in an inveited order of magnitude; the other columns are 
obtained by the method of § 9. The graduation for the central 
value will be obtained by an eleven-point parabola of the second 
degree. 
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f.7-1 

674 

674 

674 

873 

1547 

2221 

2895 

1005 

2552 

4773 

7668 

1216 

3768 

8541 

16209 

1331 

5099 

13640 

29849 

1239 

6338 

19978 

49827 

1640 

7978 

27956 

77783 

1385 

9363 

37319 

115102 

1366 

10729 

48048 

163150 

1315 

12044 

60092 


851 

12895 




l,ct 11 =; remark, that these numbers fifjure in llsslicr's table (p. 
117). We shall have 


T; » 6-, = 1269^/11 = 1172 , 272727 . . , 

7 = 60092/55 « 1092 , 501818 . . . 
165190/165 = 983 , 7373 ... 

Hence 

” 27 ^- 5 / .^ 7^. -127 896969 ... 

h'roni Table U, we take 5;j =-i, 9250 7691, and finally the 
required gnuhiated value will be 

1418,22 


agreeing with iij.\7(cr’,s- result. 

§ 22, P. Lorciits in the first edition of his paper “Der Trend"*' 
introduced orthogonal polynomials, distinguishing two cases ac- 
cording as the number of observations was either even or odd. 
He denoted polynomials of degree m chose them 

so that should be equal to /V. 

If 777 is odd the variable takes the values 


'“Vierteljahresliefte zur Koiijunkturforschung, Sonderheft 9, Berlin, 1928, 
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and the value of C in (x) corresponding to the above con- 
dition, taken from (24), is 





Hence is given by formula (21') by putting in it the above 
value of C and placing so that we have 



If m is even, the variable in takes the values 

-rN-3i . • . /, 5, • • •, r/v-y; 

and the value of C corresponding to the condition -/ywill 
be obtained from (24) by putting h=2 , so that 




The polynomial is given by (2T) by. placing in it this value of 
C and h‘2, a=r-('n-l). Hence it follows that 
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Whether h/ l}e odd or even, the uielficient of is tjiven l)y 
our tormula (32), the sumc formula appealing in Bsshcr's expan- 
bion, i.c., 


<3 


m 




m 



llere6J„ib the mean orthogonal moment of degree m ,and(f;^is 
given by formula (34), or since C^r} = 

The paper contains five decimal tab!e,s giving corre- 

sponding to the necessaiy integer values of X up to 5 and 
N up to 60. There are also other tahle.s ii.seful for the tian.sfor- 
mation of the orthogonal series into a power series, and also a 
table enabling one to change the interval of one year into that of 
one month. 

The second edition of the pajxir does not differ in principle 
from the I’lrst. The polynomials remain the same, but the tables 
{arX„ fx) have been extended for m up to six, and for N up 
to eighty, 

The only advantage that Lorentss method possesses over ours 
is that when applying Chetverikoff’s method to the determination 
of binomial moments the calculation in his system is a little easier, 
since the numbers to be added contain one or two figures less. 
Hut as this operation is generally made by calculating machines, 
this is but a slight advantage, and this is largely compensated for 
in the subsequent operations. 

As an example, let us determinate the coefficients corresponding 
to Lorents’s polynomials in the orthogonal expansion of the ex- 
ample in § 13. There, the mean orthogonal moments found were 

&o ' 99^2. 3333 ©3 = iS3 

Q, = 9 7i, 33 3 3 0,^“ 35^ 666 

9 ^^ 36 , 5-33 5 
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We have seen that C^=J] the other numbers | are 

taken from Table II., their square roots being 

^/C, ^ i, 46^ 6^0 109 v/Q = ^36 306 2J0 

\ICg = 0,91 Z 670 9'?9 Q, 462 9JO 050 

JC^ = O, 194 503 350 


Finally, the required coefficients are given by our formula 




a„ ■- 9942, 3333 3 


f 42 1 38641 


< 3 , « 51 , 49Z33 


a.= 162 , 79003 


5 ^' -177 57459 

in accordance with Lorentz's results. 

The corresponding mean square deviations would be given by 

gZ 


~J- T'\/^ -ei ^ 
h! - ^ 0-^1 


a 


m ' 


C4iaA^ 'JatcTa.^ 




TABLE III 


These numbers are given to ten figures, although generally fewer 
will be sufficient, especially if as is generally the case the mean 
orthogonal moments are all of the same order of magnitude. In 
this event, a fixed number of dccmls, properly chosen will suffice. 

The numbers given are checked by the relation: 
ml 


Remark. 


lim C and lifU i^ 

/Y-M 
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/vj 


^,1 

Qo 


3 - 

■ 

1,5 

1.5 

0,5 

-1.5 

4 - 

•1.8 

1,2 

1 

-2 

5 ■ 

-2 

1 

1.42H 571 429 

-2,142 857 143 

6 ■ 

-2,142 857 143 

0,857 142 857 1 

1.785 714 286 

-2.142 8.57 14.3 

7 ■ 

-2,25 

0,75 

2,08,1 333 333 

-2,083 333 333 

B 

-2.33,3 333 333 

(),(i66 (Kill <-)66 7 

2..3.3.1 3.33 333 

“2 

9 

-2,4 

0,6 

2„545 4.34 543 

- 1 ,'M)9 (»0 '109 

10 

-2,454 545 455 

0.545 454 545 5 

2,727 272 727 

-1.818 181 818 

11 

-2.5 

0,5 

2.8S4 615 .38.3 

-1.7.10 769 2.11 

12 

-2,538 461 5.18 

0,461 5,18 401 5 

3 021 978 021 

-1,648 ,351 648 

13 

-2,571 428 ,571 

0,428 571 428 6 

3.142 8.37 1-13 

-1.571 428 571 

14 

-2,6 

0,4 

3.2.3 

-1.5 

IS 

-2,625 

0,37.1 

3..3-15 5HH 235 

-1,43.3 823 .529 

16 

-2,647 058 823 

0..152 041 176 4 

3.431 372 540 

-1,.372 .540 020 

17 

-2,66(1 666 667 

(l,.1.13 3.33 333 3 

.3, .508 771 9.«l 

-1..315 789 474 

18 

-2,684 310 526 

(1,315 789 473 6 

3.S78 947 368 

- 1,26,3 1,57 m 

19 

-2,7 

0,3 

.3.642 8.37 143 

-1,214 285 714 

20 

-2,714 285 714 

0,285 714 285 7 j 

3,701 298 702 

-1.168 831 169 

21 

-2,727 271 727 

0,272 727 272 7 I 

.3,7.34 940 71 1 

- 1,12(1 482 213 

2J 

-2,7.19 130 436 

0,20(1 8f>9 56S 2 

3,804 347 82(i 

-1,086 '1,5(1 522 

23 

-2,75 

0,25 


-1,05 

24 

-2,7(1 

0.24 

3.892 307 h<)2 

-1,015 384 615 

25 

-2,769 2,10 769 

0,2,10 709 2,10 8 

3,931 023 933 

-0,'i82 905 982 0 

26 

-2,777 777 778 

0,222 222 222 2 

3,9(.8 253 9(,9 

-0,'152 380 952 4 

27 

-2,785 714 286 

0.214 285 714 2 

4,002 463 05.3 

-0,923 645 320 5 

28 

-2,793 103 447 

0,2()(> 806 SSI 0 

4,034 482 758 

-0,896 551 723 9 

20 

-2,8 

0,2 

4.ll(.4 516 128 

-0,870 967 741 8 

30 

-2,806 451 614 

0,193 548 387 1 

4.092 74 1 935 

-0,846 774 193 4 

31 

-2,8125 

0,1875 

4,119 318 182 

-0,823 863 636 4 

32 

-2,818 181 818 

0,181 818 181 8 

4,144 385 026 

-0,802 1.39 037 4 

33 

-2,823 529 411 

0,176 470 588 2 

4,168 067 225 

-0.781 512 60S 1 

34 

-2,828 571 429 

0,171 428 571 4 

4,190 476 192 

-0,761 'X)4 761 9 

35 

'2,833 333 333 

0,166 666 666 7 

4,211 711 712 

-0,743 24.3 243 3 

36 

-2,837 837 838 

0,162 162 162 2 

4,231 863 442 

-0,725 462 304 4 

37 

-2,842 105 263 

0,157 894 736 8 

4,251 012 146 

-0,708 502 024 3 

38 

-2,846 153 846 

0,153 846 153 8 

4,269 230 769 

-0,092 307 692 3 

39 

-2,85 

0,15 

4,286 585 369 

-0,676 829 268 3 

40 

-2,853 658 537 

0,146 341 463 4 

4,303 135 888 

-0,662 020 90S 9 

41 

-2,857 142 857 

0,142 857 142 9 

4.318 936 877 

-0,647 840 531 S 

42 

-2,8C>0 465 116 

0,139 534 883 7 

4,334 038 OSS 

-0,634 249 471 4 

43 

-2,803 036 304 

0,136 363 636 4 

4,348 484 848 

-0,621 212 121 2 

44 

-2,866 666 667 

0,133 333 .333 3 

4,.362 318 840 

-0,608 695 652 1 

45 

-2,869 565 217 

0,130 434 782 6 

4,375 578 168 

-0,5% 669 750 2 

46 

1 -2,872 340 425 

0,127 659 574 6 

4,388 297 872 

- 0,585 106 382 9 

4/ 

’ -2,875 

0,125 

4,400 SIO 204 

- 0,573 979 591 8 

48 

1 -2,877 SSI 021 

0,122 448 979 6 

4,412 244 898 

- 0,563 265 306 1 

49 

1 -2,88 

0,12 

4,423 529 411 

- 0,532 941 176 4 

5C 

1 -2,882 352 941 

0,117 647 058 8 

4,434 389 140 

-0.,542 986 425 4 
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N 

^10 

^it 



51 - 

2,884 615 385 

0,115 384 615 4 

4,444 847 606 

-0,533 381 712 7 

52 - 

2,886 792 453 

0,113 207 547 2 

4,454 926 625 

-0,524 109 014 7 

S3 

2,888 888 889 

0,111 in 111 1 

4,464 646 465 

-0,515 151 SIS 2 

54 

2,890 909 091 

0,109 090 909 1 

4,474 025 974 

-0,506 493 506 5 

55 

2,892 857 143 

0,107 142 857 1 

4,483 082 706 

-0,498 120 300 7 

56 

-2,894 736 842 

0,105 263 157 9 

4,491 833 031 

-0.490 018 148 9 

57 

-2,896 551 724 

0,103 448 275 9 

4,500 292 227 

-0,482 174 167 2 

58 

-2,898 305 085 

0,101 604 9IS 3 

4,508 474 576 

-0,474 576 271 2 

56 

■2,9 

0,1 

4,516 393 442 

-0,467 21.1 114 7 

60 

-2.901 639 344 

0,098 360 655 74 

4,524 061 343 

-0,460 074 034 9 

61 

-2,903 225 806 

0,096 774 193 54 

4,531 490 014 

-0,453 149 001 5 

62 

-2,9(14 761 90S 

0,095 218 095 24 

4.538 690 476 

-0,446 428 571 4 

63 

-2,906 250 

0,093 750 

4,545 673 077 

-0,439 903 846 2 

64 

-2,907 692 307 

0,092 307 692 30 

4.552 447 552 

-0.435 566 433 6 

65 

-2,909 090 909 

0,090 909 090 91 

4,559 02,1 067 

-0,427 408 412 5 

66 

-2,910 447 761 

0,089 552 238 80 

4.565 408 253 

-0,421 422 .100 3 

67 

-2,911 764 706 

0,088 235 294 12 

4,571 ()11 2.54 

-0,415 601 023 1 

68 

-2,913 043 478 

0,086 956 521 74 

4,577 639 752 

-0,409 937 888 2 

69 

-2,914 285 714 

0,085 714 285 72 

4.583 SOI 007 

- 0,404 426 5.59 4 

70 

-2,915 492 960 

0,084 507 042 26 

4,589 201 879 

- 0,399 061 0,12 9 

71 

-2,916 666 667 

0,083 333 333 33 

4.594 748 858 

- 0,.39,1 815 610 4 

72 

-2,917 808 219 

0,082 191 780 82 

4,600 148 094 

- 0„188 744 900 4 

73 

-2,918 918 919 

0,081 081 081 08 

4,605 405 405 

- 0,38,1 78,1 783 8 

74 

•■2,92 

0,08 

4,610 526 316 

- 0,378 947 368 4 

75 

-2,921 052 632 

0,078 947 368 42 

4,615 516 062 

- 0,374 231 0,12 1 

76 

-2,922 077 922 

0,077 922 077 92 

4,620 379 620 

- 0,369 630 360 6 

77 

-2,923 076 923 

0,076 923 076 92 

4,625 121 713 

- 0,365 141 187 9 

78 

-2,924 050 633 

0,075 949 307 08 

4,629 746 835 

- 0,360 759 493 7 

79 

-2,925 

0,075 

4,634 259 260 

- 0,356 481 481 S 

8C 

-2,925 925 926 

0,074 074 074 07 

4,638 663 052 

- 0,352 303 523 0 

81 

-2,026 829 268 

0,073 1 70 731 70 

4.642 962 091 

-0,348 222 156 8 

82 

-2,927 710 843 

0,072 289 156 62 

4,647 160 068 

- 0,344 234 079 1 

8. 

-2,928 571 428 

0,071 428 571 42 

4,651 260 504 

- 0,340 336 1,14 4 

8^ 

-2,929 411 765 

0,070 588 235 30 

4,655 26b 758 

- 0,336 525 307 3 

8- 

-2,930 232 558 

0,069 767 441 86 

4.659 182 037 

- 0,3.12 798 716 9 

8( 

-2,931 034 483 

0,068 965 517 24 

4,()63 009 403 

- 0,329 153 604 9 

8' 

-2,931 818 182 

0,068 181 818 18 

4,666 751 789 

- 0,325 587 334 1 

8 

-2,932 584 270 

0.067 415 730 34 

4,670 41 1 986 

- 0,322 097 378 4 

8 

-2,933 333 333 

0,066 666 666 67 

4,673 992 674 

-0,318 681 318 7 

■ 

3 -2,934 065 934 

0,065 934 065 93 

4,677 496 416 

-0,335 336 837 0 

9 

1 -2,934 782 608 

0,065 217 391 30 

4,680 925 664 

-0,312 061 711 0 

9 

2 -2,935 483 871 

0,064 516 129 04 

4,684 282 773 

- 0,308 853 809 2 

■ 

3 -2,936 170 213 

0,063 829 787 24 

4,687 569 990 

l-Q,.iOS 711 086 3 

9 

4 -2,936 842 105 

0,063 157 894 74 

4,690 789 473 

-0,302 6.11 578 9 

9 

5 -2,937 S 

0,062 5 

4,693 943 301 

-0,299 613 402 2 

9 

6 -2,938 144 330 

0,061 855 670 10 

4,697 033 453 

- 0,296 654 744 5 


7 -2,938 775 510 

0,061 224 489 80 

4,700 061 841 

-0,293 75.1 865 1 


8 -2,939 393 939 

0,060 606 060 61 

4,703 030 303 

-0,290 909 090 9 


9 -2,94 

0,06 

4,705 940 594 

-0,288 118 811 9 

1C 

zs 

0 -2,940 594 060 

0,059 405 940 59 

4,708 794 409 

-0,285 381 479 3 
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3 
2 

1.428 ; 
1,071 -i 
0,833 ; 
, 0,666 ( 
0,545 ^ 
' 1,454 : 
1,384 ( 
0,329 ■ 
0,285 
[4 0,25 

15 0,220 

16 0,196 

17 0,175 

18 0,157 

19 0,142 

20 0,129 

21 0,118 
22 0,108 

23 0,1 

24 0,092 
0,085 


571 429 
428 572 
333 333 3 
666 666 7 
454 545 5 
545 454 5 
615 384 6 
670 329 7 
714 285 7 

588 235 2 
078 431 4 
: 438 596 S 
' 894 736 8 
1 857 142 9 
I 870 129 9 
! 577 075 1 

1 695 652 2 

2 307 692 28 
5 470 085 50 




ISXIJ 


-2 

-2.5 

-2.5 

-2„333 .333 3,33 
- 2,121 212 121 
-1,909 000 909 
-1,713 286 714 
-l.-W 461 .5,38 
-1„384 615 ,384 
-1,25 

-1,13 2 3,52 041 
-1,029 411 765 
-0,0,30 112 487 0 
-0,8.50 649 122 8 
-0.789 47.3 684 2 
-0,727 272 727 3 
-0.671 936 758 9 
-0,622 529 644 .3 
-0,578 260 869 6 
-n,.5,38 47.1 .5.38 4 
-0,.502 564 102 5 

- 0,470 085 471) 1 
-0.440 61,3 026 8 
-0,41.3 793 103 4 

- 0.389 ,321 468 2 

- 0,366 935 484 0 
-0,346 407 624 S 
-0,327 540 106 9 
-0,310 160 427 9 
-0,294 117 647 1 
-0,279 279 279 ,3 
-0,265 528 686 6 
-0,252 762 884 4 
-0,240 890 688 2 
-0,229 83! 144 5 
-0,219 512 1 95 1 
-0.209 869 540 6 
-0,200 845 666 0 
-0,192 .389 006 4 
-0,184 453 227 9 
-0,176 996 608 1 
-0,169 981 498 6 
-0,163 ,373 860 2 
-0,157 142 857 1 
-0,151 260 504 2 
-0,145 701 357 4 
-0,140 442 243 7 
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N 



C4.1 


4 

5 

4 

2..5 

0,071 428 571 43 

-0„357 142 857 2 

1,071 428 .571 

6 

l,r)7ift 606 (i67 

0,214 285 714 3 

'0,857 142 857 2 

1,928 571 429 

7 

1,106 06fi 667 

0,409 09(1 'XW 1 

- 1,363 636 .364 

2,454 .545 455 

R 

0,848 484 848 5 

0,6,36 .3(1,3 6.36 4 

-1,818 181 818 

2,727 272 727 

I) 

0,6.36 .36.3 6,36 4 

O.RRl 118 881 1 

-2,202 797 20.3 

2,8.32 167 8.32 

10 

0,489 510 489 6 

1,1,32 867 1.33 

-2,517 482 517 

2,8.32 167 832 

n 

0„384 615 .384 6 

1„384 615 .385 

-2,760 230 769 

2.769 230 769 

12 

0..307 692 .307 6 

1.631 868 1.32 

-2,067 032 967 

2,670 .329 670 

12 

0,25 

1.871 848 7.30 

-3,119 747 809 

2,552 521 008 

14 

0,205 882 .35,3 0 

2,102 941 177 

-. 3 , 2.35 294 118 

2.426 470 589 

15 

0,171 568 627 5 

2,.324 303 405 

-.3,.320 43.1 436 

2,298 761 610 

Ui 

0,144 478 844 2 

2,5.35 60.3 715 

-3.380 804 95.3 

2,17.3 .374 613 

17 

0,122 807 017 5 

2.736 842 104 

-3,421 052 ().30 

2,052 6.31 578 

18 

0,105 26.3 1,57 9 

2,928 229 666 

-3.444 976 077 

1,937 790 043 

I'J 

0,01)0 909 090 91 

.1,110 107 28.3 

-.3,455 674 759 

1,829 474 872 

20 

0,079 051 ,383 40 

,3.282 891 022 

—3, *155 76(1 

1,727 837 380 

21 

0,069 169 960 48 

.3,447 0.35 57.3 

-. 3,447 035 57.3 

1,632 806 324 

21 

0.060 8(i9 565 22 

.3,(i(1.3 out 0,3,3 

-,3.431 4,38 127 

1,544 147 1 57 

22 

0,(].53 846 15,3 84 

,3,751 282 051 ’ 

-.3.410 2.56 410 

1,461 538 461 

24 

0,047 8(.,3 247 8(i 

.3,802 ,307 69.3 

-.3.,384 61.5 .38,5 

1,.3R4 615 .385 

25 

0,042 7,15 042 74 

4,02(1 525 199 

-3,355 437 666 

1,312 997 347 

2ft 

0,0,38 .314 17ft 24 

4,1.54 .351 ,39(1 

-.3,.323 4H1 1 17 

1,246 .305 419 

27 

0,0.34 482 758 62 

4.276 179 88.3 

-3,289 .369 141 

1,184 172 891 

2S 

0.0.31 14.5 717 40 

4„3y2 380 423 

-.3.253 615 128 

1,126 251 391 

29 

0,028 225 80() 4(i 

4,50.3 2'W 121 

-3,216 642 229 

1,072 214 076 

,30 

0.025 659 824 04 

4,609 259 101 

-.3,178 799 .380 

1,021 756 943 

,31 

0,023 ,393 721 92 

4,710 561 498 

-3,140 374 3.32 

0,974 598 930 5 

.32 

0,021 .390 374 ,34 

4,807 486 6.32 

-3,101 604 278 

0,9.30 481 283 5' 

.3.3 

0,010 607 84,3 14 

4,9(XI 295 25,3 

-3.0(.2 684 5,3.3 

0,889 166 477 4 

.34 

0,018 018 018 02 

4,989 229 831 

- 3,023 775 655 

0,850 436 903 0 

35 

0,016 595 542 91 

5,074 515,814 

-2,985 009 302 

0,814 093 446 0 

30 

0,015 318 962 69 

5,156 362 841 

-2,946 493 052 

0,779 954 043 1 

.37 

0,014 170 040 49 

5,2.34 965 933 

-2,908 314 407 

0,747 852 276 1 

,38 

0,013 133 208 26 

5,310 506 567 

-2,870 544 090 

0,717 636 022 S 

.39 

0,012 195 121 95 

5,383 1.53 716 

-2,833 238 798 

0,689 166 194 0 

41) 

0,011 344 299 49 

5,453 064 8(13 

-2,796 443 488 

0,662 315 563 1 

41 

0,010 570 824 62 

5,520 386 590 

-2,760 193 29S 

0,636 967 683 S 

42 

0,009 866 102 890 

5,585 255 997 

-2,724 515 121 

0,613 OIS 902 2 

43 

0,009 222 661 396 

5,647 800 858 

-2,689 428 980 

0,590 362 459 0 

44 

0,008 633 980 882 

5,708 140 610 

-2,654 949 121 

0,568 917 668 8 

45 

1 0,008 094 .357 076 

5,766 386 943 

-2,621 084 974 

0,548 599 180 6 

4( 

) 0,007 598 784 1 94 

5,822 644 377 

-2,587 841 945 

0,529 331 307 0 

4/ 

' 0,007 142 857 142 

5,877 010 804 

-2,555 222 089 

0,511 044 417 7 

41 

! 0,006 722 689 076 

5,929 577 985 

-2,523 224 675 

0,493 674 392 9 

4! 

) 0,006 334 841 628 

5,980 431 999 

-2,491 846 666 

0,477 162 127 6 

SO 0,005 976 265 688 

6,029 653 662 

-2,461 083 127 

0,461 453 086 4 
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IS 

r 

'-33 

^4.0 

^41 


SI 

0,005 644 2S0 928 

6,077 318 907 

-2.430 927 563 

0,446 496 899 3 

52 

0,005 336 382 694 

6,123 499 143 

-2,401 372 213 

0,432 246 998 3 

S3 

0,005 050 SOS OSl 

6,168 261 563 

-2,372 408 293 

0,418 660 287 1 

S') 

0,004 784 688 996 

6,211 669 457 

-2,344 026 210 

0,405 696 844 1 

SS 

0,004 S37 20S 082 

6,253 782 470 

-2,316 215 7.30 

0,393 319 652 2 

56 

0,004 306 499 738 

6,294 656 864 

-2,288 966 132 

0,381 494 3SS 4 

57 

0,004 091 174 752 

6,334 345 746 

-2,262 266 338 

0,370 189 037 1 

58 

0,003 889 969 436 

6,372 899 282 

-2,236 105 on 

0,359 374 019 7 

59 

0,003 701 745 108 

6,410 364 887 

-2,210 470 651 

0,349 021 681 7 

60 

0,003 525 471 S32 

6,446 787 416 

-2,185 351 666 

0,339 106 293 1 

61 

0,003 360 215 054 

6,482 209 321 

-2,160 736 440 

0,329 603 863 8 

62 

0,003 205 128 206 

6,516 670 830 

-2,1,36 613 .387 

0,320 492 008 1 

63 

0,003 059 440 560 

6,550 210 049 

-2,112 970 984 

0,311 749 817 2 

64 

0,002 922 450 684 

6,582 863 143 

-2,089 797 823 

0,303 357 748 5 

65 

0,002 793 519 036 

6,614 664 415 

-2.0b7 082 630 

0.295 297 518 5 

66 

0,002 672 061 686 

6,645 646 448 

-2,044 814 292 

0,287 552 009 8 

67 

0,002 557 544 758 

6,675 840 208 

-2,022 981 881 

0,280 105 183 5 

68 

0,002 449 479 48(i 

0.705 275 129 

-2,001 574 665 

0,272 941 999 8 

69 

0,002 347 417 840 

6,733 979 218 

-1,980 582 123 

0,266 048 344 9 

70 

0,002 250 948 614 

6,761 979 132 , 

- 1,959 993 951 . 

0,259 410 964 1 

71 

U.002 150 093 940 

6,789 300 259 

-1,939 800 074 

0,253 017 401 0 

72 

0,002 07,-) 300 182 

6,815 966 796 

-1,919 990 647 

0,246 855 940 3 

73 

0,001 991 465 149 

6,842 001 810 

-1,900 556 0.58 

0,240 915 556 7 

74 

0,001 913 875 598 

6,867 427 310 

-1,881 485 934 

0,235 185 K66 8 

75 

0,001 840 264 998 

6,892 264 298 

-1,862 774 135 

0,229 657 085 1 

7t, j 

0,001 770 381 517 

6,916 532 835 

-1,844 408 756 

0,224 319 983 8 

77 \ 

0.00 1 703 992 210 

6,940 252 082 

-1,826 382 127 

0,219 165 855 2 

78 

0,001 640 881 388 

6,963 440 362 

-1,808 685 808 

0,214 186 477 3 

79 

0,001 580 849 142 

6,986 IIS 193 

-1,791 311 588 

0,209 374 OSl 7 

80 

0,001 523 710 016 

7,008 293 336 

-1,774 251 477 

0,204 721 324 3 

SI 

0,001 469 291 801 

7,029 990 839 

-1,757 497 710 

0,200 221 258 1 

82 

0,00! 417 434 443 , 

7,051 223 066 

-1,741 042 732 

0,195 867 307 4 

S3 

0,001 367 989 056 

7,072 004 744 

-1,724 879 206 

0,191 653 245 1 

84 

0,001 320 817 020 

7,092 349 982 

-1,708 999 996 

0,187 573 170 2 

85 

0,001 275 789 167 

7,112 272 313 

-1,693 398 170 

0,183 621 488 3 

SO 

0,001 232 785 038 

7,131 784 720 

- 1,678 066 993 

0,179 792 892 1 

87 

0,001 191 692 203 

7,150 899 663 

-1,662 999 932 

0,176 082 344 6 

88 

0,001 152 405 647 

7.169 629 101 

-1,648 190 598 

0,172 485 062 6 

89 

0,001 114 827 202 

7,187 984 527 

-1,633 632 847 

0,168 996 501 4 

90 

0,001 078 865 034 

7,205 976 979 

-1,619 320 669 

0,165 612 341 2 

91 

0,001 044 433 171 

7,223 617 068 

-1,605 248 237 

0,162 ,328 473 4 

92 

0,001 011 451 071 

7,240 915 001 

-1,591 409 890 

0,159 140 989 0 

93 

0,000 979 843 226 0 

7,257 880 596 

-1,577 800 129 

0,156 046 166 7 

94 

0,000 949 538 795 4 

7,274 523 294 

-1,564 413 612 

0,153 040 462 0 

95 

0,000 920 471 281 2 

7,290 852 189 

-1,551 245 147 

0,150 120 498 1 

96 

0,000 892 578 212 2 

7,306 876 040 

-1,538 289 69.3 

0,147 283 055 7 

97 

0,000 865 800 805 8 

7,322 603 281 

-1,525 542 350 

0,144 525 064 8 

98 

0,000 840 084 008 4 

7,338 042 040 

-1.512 998 359 

0,141 843 596 1 

99 

0,000 815 375 655 2 

7,353 200 151 

-1,500 653 092 

0,139 235 853 9 

100 

0,000 791 626 849 8 

7,368 085 172 

-1,488 502 055 

0,136 699 168 3 
-i — — — 
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seseps 

N 

^43 




5- 

-2,S 

5 



6 ■ 

-1 

1 

- 0.02,1 809 521 81 

0,142 .8.57 142 9 

7 

-2.861 616 164 

1,009 090 909 

- 0.08.3 111 111 11 

0,416 666 666 7 

8- 

-2,.S4.4 4S4 545 

1,272 727 271 

-0.179 487 179 5 

0.769 2-30 769 2 

9 

-2,202 797 201 

0,881 118 8.81 1 

- 0.107 (i'>2 107 7 

1,153 846 154 

1(1 

-1,888 111 8HH 

0,629 170 629 4 

- 0,461 518 461 5 

1,538 461 5.38 

11 

-1,615 .184 615 

0,461 5.18 461 5 

- 0,634 615 384 5 

1,003 846 154 

12 

-1,384 615 ,185 

0,.146 151 846 2 

- 0,821 266 968 5 

2.239 819 005 

13 

-1,191 176 47! 

0,264 70.5 882 4 

-1.016 8116 723 

2,542 016 807 

14 

-1,029 411 765 

0,205 882 151 0 

- 1,217 402 260 

2.809 597 523 

IS 

-0,891 962 848 2 

0,162 5.18 699 7 

- 1.420 407 637 

3,043 7.10 650 

Ifi 

-0,780 185 758 4 

0,110 010 959 7 

-1,623 .12,1 013 

3,246 646 027 

17! 

-0,684 210 526 1 

0,105 263 157 9 

- 1,824 561 40.1 

3,421 052 6.11 

18! 

-0,602 870 811 5 

0,086 124 401 91 

- 2,022 88.1 295 

1,-569 7W 051 

19 

-0,511 596 817 7 

0,071 146 245 01 

- 2.2)7 ,19) .104 

,1,695 6.52 171 

20| 

-0,474 108 10(1 4 

0,050 288 517 SS 

- 2,407 4.5.3 416 

,1,80 1 24 2 2,36 

21 

-0,421 120 1.58 1 

(),()4'J 802 371 54 

-2, .592 642 141 

,3,888 961 211 

22 

-0,179 264 214 0 

0,042 MO 468 22 

- 2,772 68(1 734 

,1,9(.t) 981' 048 

2,1 

-0,141 025 641 0 

0,035 897 435 90 

- 2,947 435 897 

4,019 211) 768 

24 

-0„107 692 307 7 

0,0.10 769 230 77 

-.3,110 828 765 

4,065 428 824 

2S 

-0,278 514 588 8 

0.026 525 198 94 

-.1,280 872 .184 

4,101 (190 480 

2(1 

-0,2.52 873 563 3 

0,022 988 505 75 

- 3.4.19 624 274 

4,127 549 129 

27 

-0,210 255 819 9 

(1,020 022 246 94 

- 3,59.1 178 929 

4.145 975 087 

28 

-0,210 231 592 9 

0.017 519 4r)f) 08 

-.1,741 657 397 

4,157 ,197 lOH 

29 

-0,192 448 680 4 

0,015 395 894 43 

-.1,885 199 241 

4,162 713 473 

1C 

1 -0,176 599 965 5 

0,013 584 612 73 

- 4,02.1 956 357 

4,162 713 472 

11 

-0,162 431 155 1 

0,012 0.12 085 S6 

- 4,158 088 215 

4,158 088 2,15 

12 

I -0,149 732 620 1 

0,010 695 187 17 

- 4,287 758 346 

4,149 44.1 .561 

11 

-0,118 114 785 4 

0,009 5.18 950 715 

-4,413 131 .198 

4,137 .31(1 686 

14 

-0,128 022 759 6 

0,008 ,514 850 639 

-4,5.14 371 270 

4,122 155 70(1 

1,S 

-0,118 721 960 9 

0,007 659 481 .147 

-4,651 639 494 

4,104 .387 789 

16 

1 -0,110 296 511 4 

0,000 891 533 210 

-4,765 094 116 

4,084 366 385 

17 

' -0,102 646 190 8 

0,006 220 991 .184 

- 4,874 888 909 

4,062 407 424 

18 

-0,095 684 801 00 

0,005 628 517 824 

-4,981 172 826 

4,0,38 788 778 

19 

-0,089 136 158 49 

0,005 104 9,34 771 

- 5,084 089 620 

4,013 754 961 

4C 

1 -0,081 535 296 24 

0,004 040 849 791 

-5,183 777 052 

.3,987 521 270 

41 

, -0,078 224 101 48 

0,004 228 329 810 

. -5,280 .169 782 

.3,960 277 316 

42 

1 -0,071 152 330 17 

0,00.1 860 648 956 

-5,373 99,1 360 

3,912 190 261 

42 

1 -0,068 875 620 22 

0,003 5-32 083 088 

- S,4M 770 274 

3,903 407 339 

4^ 

( -0,064 754 856 61 

0,00-3 2.17 742 831 

-5,552 817 057 

3,874 058 412 

4! 

i -0,060 955 464 51 

0,002 973 4.17 29.1 

- 5,638 245 on 

3,844 257 962 

4( 

5 -0,057 446 808 51 

0,002 715 562 110 

-5,721 160 378 

3,814 106 919 

4: 

I -0,054 201 680 67 

0,002 521 008 40,1 

- 5,801 664 512 

3,783 694 247 

4i 

i -0,051 195 862 96 

0,002 327 (184 o80 

- 5,879 854 060 

3,753 098 336 

41 

5 -0,048 407 752 07 

0,002 151 455 648 

- 5,955 821 167 

3,722 388 229 

m 

0 -0,045 818 016 94 

0,0(11 992 088 563 

- 6,029 653 660 

3,691 624 690 
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51 -0,043 409 420 76 

52 -0,041 106 380 79 

53 -0,039 074 960 13 

54 -0,037 122 587 04 

55 -0,035 297 917 50 

56 -0,033 590 607 96 

57 -0,031 991 645 18 

58 -0,030 492 341 06 

59 -0,029 085 140 14 

60 -0,027 763 088 32 

61 -0,026 519 851 11 
62-0,025 349 650 35 

63 -0,024 247 208 01 

64 -0,023 207 696 61 

65 -0,022 226 694 94 

66 -0,021 300 148 87 

67 -0,020 424 336 30 

68 -0,019 595 835 88 

69 -0,018 811 499 13 

70 -0,018 068 425 36 

71 -0,017 363 939 28 
72-0,016 695 570 84 

73 -0,016 061 037 11 

74 -0,015 458 225 99 

75 -0,014 885 181 44 

76 -0,014 340 090 29 

77 -0,013 821 270 IS 

78 -0,013 ,327 158 59 
70 -0,012 856 30,5 26 

80 -0,012 407 352 99 

81 -0,0U 979 049 63 

82 -0,011 570 220 69 

83 -0,011 179 772 63 

84 -0.010 806 684 71 

85 -0,010 450 003 40 

86 -0,010 108 837 31 

87 - 0,009 782 352 480 
8S -0,009 469 768 142 
89 -0,009 170 352 790 
W-O.OD8 88.5 420 fiOD 

91 -0,008 008 328 137 

92 -0,008 .54 1 471 .535 
9,1-0,008 091 282 715 
94 -0,007 848 228 ,S21 
'.3 -0,007 614 807 87,5 
9(1 -O.OO;' .!90 .547 597 

y;-u,uo; 17.5 00.5 21 s 

98-0,O0i) 907 7.^5 600 

99 -0,00(1 7o 8 409 5i,4 

100 -11 Olio .57(1 .592 290 


0,001 847 209 394 
0,001 715 265 866 
0,001 594 8% 332 
0,001 484 903 482 
0,001 384 232 059 
0,(X)1 291 949 922 
0,001 207 231 894 
0,001 129 345 965 
0,001 057 641 460 
0,000 991 538 868 6 
0,000 930 521 091 5 
0,000 874 125 874 3 
0,000 821 939 254 5 
0,000 773 .589 887 0 
0,000 728 744 096 5 
0,000 687 101 S76 5 
0,000 648 391 628 6 
0,000 612 369 871 4 
0,000 S78 815 357 9 
0,000 547 528 041 3 
0,000 518 326 545 7 
0,000 491 046 201 2 
(1,000 465 S37 307 6 
0,000 441 66.5 599 6 
0,000 419 300 885 6 
0,000 308 335 841 8 
0,0(10 378 664 935 6 
0.000 360 193 475 3 
O.OOO 342 834 75.5 7 
0,00() 326 509 289 2 | 
0,000 311 144 146 2 
0,000 296 672 325 4 
0,000 283 032 218 5 ' 
0,000 270 167 117 6 
0,000 258 024 775 3 
0,000 246 557 007 5 
0,000 235 719 336 9 
0,000 225 470 670 0 
0,000 215 773 006 8 
0,000 201) 591 176 8 
D.UOO 197 892 600 8 
0,000 189 647 075 8 
(1,(K)0 181 826 577 9 
0,000 174 405 084 9 
0,000 107 358 414 8 
0,11(1(1 ICO 604 078 2 
O.OOO 1.54 .101 14 4 4 
0,000 148 250 119 2 
0,00(1 142 492 832 9 
O.OOO 1.17 (112 3,59 4 


-6,101 435 253 
-6,171 245 726 
-6,239 161 120 
-6,305 253 929 
-6.369 593 254 
-6,432 244 994 
-6.49.5 271 986 
-6,552 734 183 
-6,610 688 791 
-6.667 590 404 
-6,722 291 149 
-6,776 040 811 
-6,828 486 947 
-6,879 675 007 
-6,929 648 433 
-6,978 448 774 
-7,026 115 774 
-7,072 687 465 
-7,118 200 254 
-7,162 689 006 
-7,206 187 117 
-7,248 726 592 
-7,290 338 111 
-7,331 051 094 
-7,370 893 763 
-7,409 893 200 
-7,448 075 405 
-7,485 465 343 
-7,522 086 99,5 
-7.557 963 401 
-7.593 116 719 
-7,627 568 248 
-7,661 338 474 
-7,694 447 108 
-7.726 913 130 
-7,758 754 806 
-7,789 989 730 
-7,820 6,54 849 
-7,850 706 50.5 
-7,880 220 438 
-7,909 191 835 
-7.937 635 345 
-7,965 565 098 
-7,992 994 727 
-8,019 937 409 
-8,046 405 84() 
-8,072 412 329 
-8,097 968 722 
-8,123 086 494 
-8,147 776 722 


3,660 861 152 
3,630 144 544 
3,599 516 031 
3,569 011 658 
3,538 662 919 
3,508 497 269 
3,478 538 504 
3,448 807 465 
3,419 321 788 
3,390 096 815 
.5,36! 145 575 
3,332 479 087 
3,304 106 587 
3,276 035 717 
3,248 272 703 
3,220 822 511 
3,193 688 988 
3,166 874 984 
3,140 .582 465 
3,114 212 611 
3,088 365 ‘X)7 
3,062 842 222 
3,037 640 880 
3,012 760 724 
2,988 200 174 
2,96,5 957 280 
2,940 029 765 
2,916 415 068 
2,893 110 382 
2,870 112 684 
2,847 418 770 
2,825 025 277 
2,802 928 710 
2,781 125 461 
2,759 611 832 
2,738 384 049 
2,717 438 278 
2,696 770 638 
2,675 377 217 
2,656 254 080 
2,6.56 397 278 
2,616 802 861 
2,597 466 879 
2,578 385 396 
2,559 554 492 
2,540 970 267 
2,522 628 853 
2,504 526 409 
2,486 659 131 
2,469 023 249 




APPROX !^^ATIOX BY ORrilOaOX.l I. PORYXO Ml. f 


6 

3.5 

2.153 S46 ; 
1,384 615 ; 
0,923 076 ! 
0.634 615 : 
0,447 963 i 
0,.323 529 . 
0,238 390 ( 
0.178 792 
0,136 222 
0.105 263 
0,082 379 
0,065 217 
0,052 173 
0,042 1 40 
0,034 ,33(1 
0,028 205 
0,023 342 
0,019 451 
0.016 314 
0,013 765 
0,011 679 
0,009 962 
0,008 538 
0,007 352 
0,006 359 
0,005 522 
0,004 814 
0,004 212 
0,003 698 
0,003 2S8 
0,002 879 
0,002 552 
0,002 268 
0,()()2 022 
0,001 mi7 
0,001 618 
0,001 453 
(1,001 .308 
0,001 180 
0,001 06(1 
0,(XH) 965 
0,000 87(1 
O.iKK) 79o 


154 
385 
923 1 
384 5 
801 0 
411 7 
092 9 
569 7 
910 2 
157 9 
862 71 
391 29 
913 05 
468 24 
I 677 82 
128 20 
: 175 07 
812 55 
423 43 
294 77 
644 05 
040 336 
899 431 
941 176 
300 476 
; SSO 415 
531 130 
714 739 
, 969 040 
; 615 581 
( 706 793 
I 467 385 
1 859 898 
! 244 692 
' 112 278 
! 871 415 
1 680 455 

4 312 409 
) (146 487 
I 580 478 

5 9,59 678 4 
) 518 967 5 

6 8.15 424 9 
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0,000 725 689 405 0 
0,000 662 032 439 7 
0,000 604 960 677 5 
0,000 553 692 823 S 
0,000 507 551 754 8 
0,000 465 949 152 1 
0,000 428 372 607 6 
0,000 394 374 781 6 
0,000 363 564 251 8 
0,000 335 597 770 9 
0,000 310 173 697 3 
0,000 287 026 406 4 
0,000 265 921 523 S 
0,000 246 551 848 0 
0,000 229 033 858 8 
0,000 212 904 713 9 
0,000 198 119 664 3 
0,000 184 549 824 3 
0,000 172 080 241 5 
0,000 160 608 225 4 
0,000 ISO 041 894 8 
0,600 140 298 914 6 
0,000 131 305 394 5 
0,000 122 994 926 5 
0,000 US 307 743 5 
0,000 108 189 981 6 
0,000 101 593 031 5 
0,000 095 472 969 38 
0,000 089 790 054 S3 
0,000 084 508 286 61 








APPROXIMATION BY ORTHOGONALPORYNOMIALS 


BiaL&sa. 

N 


^61 

^62 

^63 

7 

0,007 57S 757 S76' 

-0,053 030 .103 0.1 


- 0,6,16 .16,1 6.16 4 

8 

0,030 303 030 303' 

-0,181 818 181 8 

0,606 060 606 1 

- 1,454 545 455 

9 

0,072 727 272 73 

-0,381 818 181 8 

1,090 <8)9 091 

- 2,181 818 182 

10 

0,136 363 636 4 

-0,636 36.1 636 4 

1,5<90 '709 001 

- 2,727 272 727 

11 

0,220 588 235 3 

- 0,926 470 588 2 

2,058 823 529 

- 3,088 2.15 204 

12 

0,323 529 411 7 

- 1,235 294 118 

2,470 .588 2.15 

- 3,2<J4 117 647 

13 

0,442 724 458 2 

- 1,549 535 604 

2,817 3.17 461 

- 3,380 804 95.1 

14 

0,575 541 795 6 

- 1,859 442 724 

3,099 071 207 

- .1,.180 804 9.M 

15 

0,719 427 244 5 

-2,158 281 734 

3,320 4.13 4.16 

- 3,320 43,1 4.i6 

to 

0,872 033 023 9 

- 2,441 692 467 

3,488 132 095 

- 3,219 814 242 

17 

1,031 273 836 

- 2,707 093 821 

3.609 458 427 

- 3,09.1 821 .509 

18 

1,105 340 129 

- 2,953 193 260 

3,691 491 .575 

- 2,953 19,1 2(,0 

19 

1,362 687 747 

-3,179 604 74,1 

3,740 711 462 

- 2,803 5.1.1 5‘;7 

20 

1,532 1)15 810 

- 3,386 561 265 

3,762 845 850 

- 2,656 126 482 

21 

1,702 239 789 

- 3,574 703 5,57 

3,762 845 830 

- 2,508 .5(i3 ‘8)1) 

22 

1,872 463 768 

-3,744 927 537 

3,744 927 5.17 

- 2,365 217 .192 

23 

2,041 954 023 

-3,898 275 863 

3,712 643 679 

- 2,227 586 2li7 

24 

2,210 114 943 

-4,035 862 069 

.1,61)8 905 518 

- 2,096 55 1 724 

2S 

2,376 467 681 

-4,158 818 441 

3,616 363 8()2 

- 1,<272 562 106 

26 

2,540 631 565 

-4,268 261 030 

.3,556 884 191 

- 1,H5S 765 6(,5 

27 

2,702 308 120 

- 4,365 266 963 

.1,4<J2 213 570 

- 1,746 106 78S 

28 

2,801 267 421 

-4,450 860 433 

3,423 738 795 

- 1,643 394 621 

20 

3,017 336 553 

-4,526 004 830 

3,352 596 170 

- 1,547 352 079 

«30 

3,170 .189 857 

-4,591 599 103 

3,279 713 645 

- 1,457 650 SW 

31 

3,320 340 729 

-4,f)48 477 020 

3,205 846 220 

- 1.373 934 095 

32 

3,467 134 739 

-4,697 408 357 

3,131 60S 571 

- 1,295 8.16 788 

Oil 

3,610 743 870 

-4,739 101 330 

3,057 484 729 

- 1,222 993 892 

34 

3,751 101 688 

-4,774 205 785 

2,983 878 615 

- I.ISS 049 787 

35 

3.888 399 310 

-4,803 316 795 

2,911 101 088 

- 1,091 662 908 

.56 

4,022 482 045 

-4,826 978 454 

2,839 399 091 

- 1,032 508 760 2 

37 

4,153 446 577 

-4,845 687 673 

2,768 ‘J64 385 

- (1,977 281 547 6 

38 

4,281 338 627 

-4,859 897 901 

2,699 943 279 

- 0,925 694 838 4 

30 

4,406 211 004 

-4,870 022 689 

2,632 444 697 

- 0,877 481 S6S S 

40 

4,528 121 979 

-4,876 439 055 

2,566 546 871 

- 0,832 393 579 7 

41 

4,647 133 947 

-4,879 490 64S 

2,502 302 89S 

- 0,790 200 914 1 

42 

4.763 312 297 

-4,879 490 645 

2,439 745 323 

- 0,750 690 868 S 

43 

4,876 724 494 

- 4,876 724 494 

2,378 889 997 

- 0,713 666 999 1 

44 

4,987 439 320 

-4,871 4S2 359 

2,319 739 218 

- 0,678 948 063 9 

45 

5,095 .526 240 

-4,803 911 411 

2,262 284 377 

- 0,646 366 964 9 

46 

5,201 054 890 

-4,854 317 898 

2,206 308 135 

-0,615 769 712 2 

47 

5,304 094 656 

-4,842 869 034 

2,152 386 237 

- 0,587 014 428 4 

48 

5,404 714 339 

-4,829 744 728 

2,099 889 012 

- 0,559 970 403 3 

49 

5,502 981 872 

-4,815 109 138 

2,(H8 982 612 

- 0,534 S17 203 1 

SO 

1 5,598 964 115 

-4,799 112 098 

1,999 630 041 

- 0,510 543 840 2 
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rv 

^60 

C6i 




51 

5,692 726 671 

- 4,781 890 403 

52 

5,784 333 767 

- 4,763 568 985 

53 

5,873 848 142 

- 4,744 261 961 

54 

5,961 330 988 

- 4,724 073 614 

55 

6,046 841 882 

- 4,703 099 242 

S6 

6,130 438 777 

- 4,081 425 075 

57 

6,212 177 960 

- 4,659 133 470 

58 

6,292 114 073 

- 4,6,36 294 580 

59 

6, .370 300 108 

- 4,612 975 940 

60 

6,446 787 415 

- 4,589 2.38 499 

61 

6, .521 625 741 

- 4,505 138 018 

62 

6,594 863 253 

- 4,540 725 519 

63 

6,666 546 5.50 

- 4,516 047 66.3 

64 

6,7.36 720 721 

- 4,491 147 147 

65 

6,805 429 383 

- 4,466 063 032 

66 

6,872 714 701 

- 4,440 831 038 

67 

6,938 617 445 

- 4,415 4K3 829 

68 

7,003 177 02,3 

- 4,390 051 268 

69 

7,066 431 525 

- 4,364 560 648 

70 

7,128 417 767 

- 4,330 036 901 

71 

7,189 1 71 .328 

- 4„313 502 797 

72 

7,248 726 59.3 

- 4,287 979 111 

73 

7,307 116 796 

- 4.262 484 798 

74 

7,364 .374 054 

- 4,237 037 127 

75 

7,420 529 411 

- 4,211 651 828 

76 

7.475 612 874 

- 4,186 343 210 

77 

7,520 653 449 

- 4,161 124 27S 

78 

7,582 679 179 

- 4,136 006 825 

79 

7,6.34 717 172 

- 4,111 001 554 

80 

7,68.5 793 649 

- 4.086 118 142 

81 

7,735 9.33 963 

- 4,061 365 331 

82 

7,785 162 634 

- 4,036 750 995 

8.3 

7,833 503 382 

- 4,012 282 220 

84 

7,88(1 979 159 

- 3,987 965 357 

85 

7,927 612 174 

- 3,963 806 087 

86 

7,073 423 909 

- 3,939 809 461 

87 

8,018 4,35 176 

- 3,915 979 969 

88 

8,062 666 103 

- 3,892 321 567 

89 

8.106 136 191 

- 3,868 837 728 

90 

8,148 864 315 

- 3,845 531 475 

91 

8,1W 808 771 

- 3,822 405 427 

92 

8,232 167 271 

- 3,799 461 817 

93 

8,272 770 972 

- 3,776 702 531 

94 

8,312 714 510 

- 3,754 129 136 

95 

8,351 996 021 

- 3,731 742 903 

96 

8,390 637 115 

- 3,709 544 830 

97 

8,428 652 946 

- 3,(S87 535 664 

98 

8,466 058 210 

- 3,665 715 926 

99 

8,502 867 159 

- 3,644 085 925 

100 

8,539 093 617 

- 3,622 645 777 


'^<SJ 

1,951 792 001 - 0,487 948 000 3 

1,905 427 594 - 0,466 635 329 1 

1,860 494 887 - 0,446 518 772 8 

1,816 951 390 - 0,427 517 974 1 

1,774 754 431 -■ 0,40 9 558 714 8 

1,733 861 472 - 0,392 572 408 8 

1,694 230 353 - 0,376 495 634 0 

1,655 819 493 - 0.361 269 707 6 

1,618 588 049 - 0,346 840 296 2 

1,582 496 034 - 0,333 157 059 8 

1,547 504 413 - 0,320 173 .326 8 

1,513 575 173 - 0,307 845 797 9 

1,480 671 365 - 0,296 134 273 0 

1,448 757 144 - 0,285 001 405 4 

1,417 797 788 - 0,274 412 475 1 

1,38 7 759 699 - 0,264 335 180 8 

1,3 5 8 610 409 - 0,254 7.39 451 7 

1,330 318 566 - 0,24 5 597 273 7 

1,302 853 925 - 0,236 882 531 8 

1,276 187 324 - 0.228 570 864 0 

1,250 290 666 - 0,220 639 529 2 

1,225 136 889 - 0,213 067 285 0 

1,200 699 943 - 0,205 834 275 9 

1,176 954 757 - 0,198 921 9,30 8 

1,153 877 213 - 0,192 312 868 9 

1,131 444 111 - 0,185 990 812 7 

1,109 633 140 - 0,179 940 509 2 

1,088 422 849 - 0,174 147 655 8 

1,067 792 612 - 0,168 598 833 4 

1,047 722 601 - 0,163 281 444 2 

1,028 193 755 - 0,158 183 654 6 

1,009 187 749 - 0,153 294 341 5 

0,990 686 967 9 - 0,148 603 045 2 

0,972 674 477 4 - 0,144 099 922 6 

0.9SS 133 996 9 - 0,139 775 706 9 

0,938 049 871 7 - 0,135 621 668 2 

0,921 407 051 6 - 0,131 629 578 8 

0,905 191 062 1 - 0,127 791 679 4 

0,889 387 983 4 - 0,124 100 648 8 

0,873 984 426 1 “ 0,120 549 576 0 

0,858 967 511 6 - 0,117 131 9,33 4 

0,844 324 848 3 - 0,113 841 552 6 

0,830 044 512 3 - 0,110 672 601 6 

0,816 US 029 6 - 0,107 619 564 3 

0,802 525 355 5 - 0,104 677 220 3 

0,789 264 857 4 - 0,101 840 626 8 

0,776 323 297 6 - 0,099 105 101 83 

0,763 690 817 9 - 0,096 466 208 58 

0,751 357 922 8 - 0,093 919 740 34 

0,739 315 464 7 - 0,091 461 706 97 


^6^ 






MR APPROXIMATION PY ORTHOGONAL POLYNOMIA IJ 


N 

^64 

^65 

^66 


7 

1,590 909 091 • 

- 3,5 

7,0 


8 

2,727 272 727 

-4,0 

4.0 

~ 0,002 331 002 331 

9 

3,272 727 273 

-3,6 

2.4 

- 0.010 489 510 49 

10 

3,409 090 909 

- 3,0 

1.5 

- 0,027 766 351 30 

11 

3,308 823 529 

- 2,426 470 588 

0.970 588 235 2 

- 0,056 561 085 97 

12 

3,088 235 294 

- 1,941 176 470 

0,647 058 823 5 

- 0,098 237 675 65 

13 

2,817 337 461 

- 1,549 535 604 

0,442. 724 458 2 

- 0,15.1 250 774 0 

14 

2,535 603 715 

- 1,239 628 483 

0,309 907 120 7 

- 0,221 362 229 1 

15 

2,263 931 888 

- 0,996 130 030 9 

0,221 362 229 1 

- 0.301 857 585 2 

16 

2,012 383 901 

- 0,804 953 560 5 

0,160 990 712 1 

- 0,393 727 285 0 

17 

1,784 897 025 

- 0,654 462 242 3 

0,118 993 135 0 

- 0,495 804 729 2 

18 

1,582 067 818 

- 0,535 469 107 6 

0,089 244 851 26 

- 0,606 864 988 6 

19 

1,402 766 798 

- 0,440 869 565 2 

0,067 826 086 95 

- 0,725 09 1 699 5 

20 

1,245 059 289 

0,365 217 391 3 

0,052 173 913 05 

- 0,851 119 894 6 

21 

1,106 719 368 

- 0,304 347 826 1 

O.IMO 579 710 14 

- 0,982 061 416 9 

22 

0,985 507 246 5 

- 0,255 072 403 8 

0,031 8,84 057 98 

- 1,117 518 164 

23 

0,879 310 344 9 

- 0,214 942 528 8 

0.025 287 356 33 

- 1,256 587 091 

24 

0,786 206 896 7 

- 0,182 068 905 5 

0,020 229 885 06 

- I,.iy8 459 827 

25 

0,704 486 466 6 

- 0,154 987 022 6 

0,016 314 423 44 

- 1,542 418 927 

20 

0,632 047 385 8 

- 0,132 554 690 4 

0,013 255 469 04 

- 1,087 832 159 

27 

0,569 382 647 3 

- 0,113 876 529 S 

0,010 845 383 76 

- 1,834 145 783 

28 

0,513 560 819 2 

- 0,098 246 417 59 

0,008 931 492 508 

- 1,980 877 446 

29 

0,464 205 623 6 

- 0,085 104 364 32 

0,007 400 379 507 

- 2,127 609 108 

30 

0,420 476 108 3 

- 0,074 003 795 07 

0.000 166 982 922 

- 2,27,1 980 25(1 

31 

0,381 648 359 6 

- 0,064 586 645 47 

0,005 166 931 638 

- 2,419 681 502 

32 

0,347 099 139 7 

- 0,056 564 304 24 

0,004 3S1 100 326 

- 2,564 448 772 

33 

0,316 291 523 7 

- 0,049 702 953 72 

0,003 681 700 276 

- 2,708 037 '.)03 

34 

0,288 762 446 6 

- 0,043 812 233 28 

0,003 129 445 235 

- 2,850 319 857 

35 

0,264 111 993 9 

- 0,038 736 425 77 

0,002 671 477 639 

- 2, <191 070 393 

36 

0,241 994 240 7 

- 0,034 347 569 64 

0,002 289 837 976 

- 3,130 190 225 

37 

0,222 109 442 6 

- 0,030 540 048 36 

0,001 970 325 701 

- 3,207 571 608 

38 

0,204 197 390 8 

- 0,027 226 318 78 

0,001 701 644 923 

- 3,403 US 319 

39 

0,188 031 764 0 

- 0,024 333 522 41 

0,001 474 758 934 

- 3,536 757 996 

40 

0,173 415 329 1 

- 0,021 800 784 23 

0,001 282 3<J9 072 

- 3,668 44S 794 

41 

0,160 1 75 861 0 

- 0,019 577 049 67 

0,001 118 688 553 

- 3,798 138 322 

42 

, 0,148 162 671 4 

- 0,017 619 344 71 

0,000 978 852 483 8 

- 3,925 800 837 

43 

0,137 243 653 7 

- 0,015 891 370 42 

O.IXX) 858 992 995 9 

- 4,051 432 057 

44 

■ 0,127 302 762 0 

- 0,014 362 362 89 

0,000 755 913 836 4 

- 4,175 005 704 

45 

: 0,118 237 859 4 

- 0,013 006 164 54 

0,000 666 982 796 8 

- 4,290 523 005 

4^ 

1 0,109 958 877 2 

- 0,011 800 464 87 

0,000 590 023 243 4 

- 4,415 990 001 

47 

’ 0,102 386 237 S 

- 0,010 726 177 26 

0,000 523 228 159 1 

-4,533 414 237 

4S 

1 0,095 449 500 S( 

5 - 0,009 766 925 638 

0,000 465 091 697 1 

- 4,648 810 235 

49 

• 0,089 086 200 51 

1 - 0,008 908 620 052 

0,000 414 354 421 0 

-4,762 195 851 

SC 

) 0,083 240 843 5; 

2 - 0,008 139 104 699 

0.000 369 959 304 5 

- 4,873 S92 245 
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fy ^64 ^6'5 ^66 ^70 

sT 0,077 864 042 61 - 0,007 447 864 945 0,000 331 016 219 8 - 4,983 023 346 

52 0.072 911 770 17 - 0,006 825 782 740 0,000 296 773 162 6 - 5,090 515 375 

53 0,068 344 710 13 - 0,006 264 931 761 0,000 266 502 840 9 - 5,196 096 435 

54 0,064 127 696 11 -0,005 758 405 365 0,000 239 933 556 9 - 5,299 795 149 

55 0,060 229 222 77 -0,005 3fK) 171 603 0,000 216 333 534 8 - 5,401 645 355 

56 0,056 621 020 50 - 0,004 884 950 788 0.000 195 398 031 5 - 5,501 675 824 

57 0.053 277 084 05 -(),(K)4 508 111 727 0.000 176 788 695 2 - 5,599 920 035 

58 0,0.50 176.34B 27 - 0,004 165 583 6.50 0,000 160 214 755 0 - 5,696 410 966 

59 0,047 296 404 03 - 0,003 853 781 069 0,000 145 425 700 7 - 5,791 181 915 

60 0,044 619 249 08 - 0,00.1 569 539 927 0.000 132 205 182 5 - 5,884 266 354 

61 0,042 128 069 32 -0,003 310 062 .589 0,000 120 365 912 3 - 5,975 697 797 

62 0,039 807 646 28 - 0,003 072 870 941 0,000 109 745 390 7 - 6,065 509 681 

63 0,0.57 644 187 24 -0,002 855 765 029 0,000 lOO 202 31.1 3 - 6,1.53 735 274 

64 0,035 625 175 68 - 0,002 656 792 763 O.OOO 091 613 543 54 - 6,240 407 602 

65 0,0,53 739 238 74 -0,002 474 210 841 0,000 083 871 55,5 94 - 6,325 S59 363 

66 0.0,51 976 029 94 -0,002 306 4(.7 733 O.OOO 076 882 257 78 - 6,409 222 878 

67 0.0,5U-,526 125 20 -0.002 1,52 176 627 0.(100 070 56,1 168 10 - 6,491 430 043 

68 0,028 780-930 52 -0,002 OK) 096 735 0,000 064 841 8,50 15 - 6,572 212 283 

69 0,027 ,532 599 82 -0,001 879 116 238 0,000 059 (»54 48.5 73 - 6,051 600 525 

70 0,025 973 961 82 -0,001 758 237 415 0.000 054 944 919 23 -6,729 625 164 

71 0,024 608 454 77 - 0,001 646 563 651 0,000 050 663 496 95 - 6,806 316 049 

72 0,023 50(1 068 20 -0,001 543 288 OOl 0,000 046 766 ,504 88 - 6,881 702 461 

73 0,022 373 290 86 -0,001 447 683 526 O.OOO 043 214 43,5 63 -6,955 813 102 

74 0,021 313 064 02 - 0,001 3.59 09.3 937 0,000 039 973 351 10 - 7,028 676 091 

75 0,020 314 739 67 - 0,001 276 926 493 0,000 037 012 362 13 - 7,100 318 949 

76 0,019 374 042 99 -0,001 200 644 918 0,000 034 304 140 51 -7,170 768 60S 

77 0,018 487 038 61 -0,001 129 763 471 0,000 031 824 .323 12 - 7,240 051 394 

78 0,017 650 100 25 -0,001 ()0.3 841 659 0,000 029 551 157 19 -7,308 193 054 

79 0,016 859 883 34 -0,001 002 479 550 0,000 027 465 19,3 15 -7,375 218 737 

80 0,016 113 300 42 -0,000 945 313 024 o' 0,000 025 549 016 88 - 7,441 153 002 

81 0,015 407 498 82 - 0,000 892 013 089 6 0,000 023 787 015 72 - 7,506 019 842 

82 0,014 739 840 54 - 0,000 842 276 602 2 0,000 022 165 173 74 - 7,569 842 663 

83 0,014 107 884 04 - 0,000 795 829 355 9 0,000 020 670 892 36 - 7,632 644 320 

84 0,013 509 367 74 -0,000 752 420 481 8 0,000 019 292 832 87 - 7,694 447 109 

85 0,012 942 195 08 -0,000 711 820 729 4 0.000 018 020 777 96 - 7,735 272 780 

86 0,012 404 420 87 -0,000 673 820 393 0 0,000 1)16 845 509 83 - 7,8lS 142 541 

87 0,011 894 239 05 -0,000 638 227 461 1 0,000 015 758 702 74 - 7,874 077 094 

88 0,011 409 971 37 -0,000 o04 865 952 2 0,000 014 752 828, 10 - 7,932 096 611 

89 0,010 950 057 25 -0,000 573 574 427 4 0,000 013 821 070 54 -7,989 220 756 

90 0,010 513 044 42 -0,000 544 204 652 3 0,000 012 957 253 63 - 8,045 468 731 

91 0,010 097 580 47 -0,001) 516 620 395 9 0,000 012 155 774 02 - 8,100 859 228 

92 0,009 702 405 050 -0,000 490 696 347 4 0,000 011 411 S42 96 -8,155 410 463 

93 0,009 320 342 835 -0,000 466 317 141 7 0.000 010 719 934 29 -8,209 140 228 

94 0,008 968 297 029 -0,000 443 376 482 3 0,000 010 076 738 23 -8,262 065 838 

95 0,008 627 243 429 -0,000 421 776 345 4 0,000 009 478 120 122 - 8,314 204 185 

96 0,008 302 22S 008 -0,000 401 426 264 1 0,000 008 920 583 647 - 8,365 571 731 

97 0,007 992 346 922 - 0,000 382 242 678 9 0,000 008 400 937 997 - 8,416 184 525 

98 0,007 696 771 961 -0,000 364 148 350 9 0,000 007 916 268 497 - 8,466 058 210 

99 0,007 414 716 343 -0,000 347 071 828 8 0,000 007 463 910 297 - 8,515 208 040 

100 0,007 14S 445 857 -0,000 330 946 966 0 0,000 007 041 424 809|-8,563 648 883 
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8 0,018 648 018 65 - 0,083 916 083 92 0,279 720 279 7 - 0,769 230 769 2 

9 0,073 426 573 42 ■ 0,283 216 783 2 0,786 713 286 6 - 1.730 769 231 

10 0,172 768 408 1 - 0,583 093 377 2 1,388 317 SOS - 2,545 248 869 

11 0,316 742 081 4 - 0,950 226 244 3 1.979 638 009 - 3,110 859 728 

12 0,500 119 076 0 ' 1,3S0 321 SOS 2,500 593 380 - 3.438 318 648 

13 0,715 170 278 6 - 1,755 417 957 2.925 606 594 - 3,575 851 393 

14 0,953 560 371 5 - 2,145 510 836 3,250 773 994 - 3,575 851 393 

15 1,207 430 341 - 2,507 739 938 3.482 972 136 - 3,482 972 136 

16 1,469 915 197 - 2,834 836 452 3,6.14 405 707 - 3,331 5,18 .565 

17 1,735 316 552 - 3,123 569 794 3.718 535 469 - 3,146 453 089 

18 1,999 084 068 - 3,373 455 .178 .1,748 283 75.1 - 2,945 080 092 

19 2,257 707 510 - 3,585 770 731 3,7.15 177 865 -2,739 1.10 4.14 

20 2,508 563 900 - 3,762 845 850 3.689 064 559 - 2,5.16 231 884 

21 2,749 771 907 - 3,907 570 690 3,618 121 010 - 2,341 137 124 

22 2,980 048 437 - 4,023 065 390 3.529 004 729 - 2,156 614 001 

23 3,198 585 323 - 4,112 466 84.1 3,427 055 703 - 1.9R4 084 881 

24 3,404 945 666 - 4,178 796 954 3,316 505 519 - 1,824 078 035 

25 3,598 977 496 - 4,224 886 626 3,200 671 687 - 1,676 542 312 

26 3,780 744 036 - 4,253 337 041 3,082 128 290 - 1..S41 004 145 

27 3,950 407 841 - 4,266 505 208 2.962 850 880 - 1,417 015 638 

28 4,108 486 555 - 4,266 505 268 2,844 336 845 - 1,30,1 654 ,188 

29 4,255 218 217 - 4,2.5.5 218 217 2,727 703 983 - 1,200 189 753 

30 4,391 134 276 ■ 4,2.14 308 0.52 2,613 770 40.1 - 1,105 825 940 

31 4,516 738 804 - 4,205 239 577 2,303 118 796 - 1,019 789 139 

32 4,632 552 620 - 4,169 297 358 2,396 147 907 - 0,941 34.1 820 6 

33 4,7.19 101 330 - 4,127 004 .184 2,293 113 547 - 0,869 801 690 2 

34 4,836 90o 423 - 4,081 139 795 2,194 I6I 180 -0,804 525 766 0 

35 4,926 478 765 - 4,030 755 353 2,099 351 746 - 0,744 931 264 8 

36 5,008 313 960 - 3,977 190 498 2,008 682 070 - 0,690 484 461 4 

37 5,082 889 168 - 3,921 085 9.10 1,922 100 946 - 0,640 700 315 3 

38 5,150 661 023 - .1,862 995 7«7 1,839 521 794 - 0,595 139 404 0 

39 5,212 064 416 - 3,803.198 337 1,760 8.12 57.1 -0,553 404 522 9 

40 5,267 511 909 -3,742 703 830 1,685 903 527 - 0,515 137 188 9 

41 5,317 393 651 - 3,681 272 528 1,614 593 214 - 0,480 014 198 8 

42 5,362 077 631 ' 3,619 402 401 1,546 753 163 - 0,447 744 336 6 

43 5,401 910 209 - 3,557 355 503 1,482 231 460 " 0,418 065 283 5 

44 5,437 216 809 - 3,495 3S3 663 1,420 875 473 " 0,390 740 755 0 

45 5,468 302 770 - 3,433 585 460 1,362 533 913 - 0,363 557 879 0 

46 5,495 454' 224 ■ 3,372 210 546 1,307 058 351 - 0,342 324 806 3 

47 5,518 939 071 - 3,311 363 442 1,254 304 334 - 0,320 868 550 6 

48 5,539 007 940 - 3,251 156 834 1,204 132 161 -0,301 033 040 2 

49 5,555 895 159 - 3,191 684 453 1,156 407 411 - 0,282 677 367 0 

50 5,569 819 708 - 3,133 023 586 1,111 001 272 - 0,265 674 217 1 
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The values of the binomial coefficients. 
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CONCERNING THE LIMITS OF A MEASURE 

OF SKEWNESS 

Hv KAYMON'I) r.AKVKIJ 

of .tl t,oi Atu r', > 

in a rccem note tn tlio Anllal^ ni Mathcm.nical StalMics,* 
Hotelling and SolnnioiK ‘le\is{cl an mtirniou.s itifilnn! of ^hI>\\il)g 
that the measure nf skewness s de-tinnl hy tin- equatictn 

Tvea n ■ medi d n 

S » - 

St<^r)Ctt}rd dovt &tion 


cannot he greatei titan unity in nlwolute value. 1 am venturing to 
offei another itroof of the laine fact, which seems to me to he of 
interest because it employs an important and well-known algebraic 
inequality. 

With Hotelling and Kolomons. 1 shall assume that we arc con- 
cerned with V readings, nr ;tr'5,with median aero ami mean x , 
where x of cour.se is T, x/n We may show that the nh.solute 
value of 3 cannot be greater than one by showing that 1/s^ is 
not less than one. Making obvious substitutions, we must then 
show that 


■nCx^ 

(Hx)^ 


^ I, 


Now according to a known theorem if ,k‘ixo.r\ positive 

numbers, and if m is a number not lying between zero and one, 
then 




rn 


^ ai-b+'"-t- H ) 


m 
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While the proof of this theorem is given in Chrystal. we shall 
outline a (simplified) proof for the case to make this note 

self-contained. For any number r we obviously have (r-l)^^O , 
Vow let r equal //f ■ /V" 

The first of these gives 

r)^<s^ ■Sna ^ i o 

(’S’i- • • • 4- k J ^ (a^b* • -t-kj ■' 

while the others give .similar inequalities. Summing the.se inequal- 
ities we have 


■ 4-k^) 


( a+b^ 


k)^ 


- 2 n4n = O , 


which i.s Chrystnl’.s theorem.' for 7t?=Z- The proof shows that 
some of the numbers k can be zero; in fact, some can 

he negative, provided a + b^- ■ • -y- /r is not zero. 

Now, suppose we have an odd number of readings, say 
Since the median reading is zero, there are s non-negative read- 
ings, which we shall now call y's, and s non-positive readings, 
which we shall call %'s, We have at once, by the above. 


esz)^ = ’ 


It follows immediately that 




Z1 


♦Chrystal, Algebra, Part II, 2ncl ed., 1922, p. 49, 



m LIMm OF A MEASURE OF SKEWNESS 

und, since nic J that 


n 


27 a ^ ) 


>z. 


Finally, 


since 2(Ly)Cn»)\^, certainly not positive. 

This proof is valid unless all the y's are zero or all the z's are 
zero. vSupposc the latter is the ca-se. Then 


» Pr'‘ >2 

(ex)^ rLyJ^ ^ 



> 2 , 


If all the readings are zero our definition of s does not give a 
definite value, 

If n is even, not odd, the proof may be modified by properly 
defining the median. In this case we can show again that 

r)E ? 

rsxj^ = 


but the possibility of the equality cannot be ruled out . 





EDITORIAL. 


Tfapezoidal Rule for Computing Seasonal Indices. 

The following method for computing seasonak is suggested by 
the Detroit Edison article on "A Mathematical Theory of Sea- 
sonals” that aj)peared in Vol. 1, No. 1 of the Amah. 

We shall likewise define “the seasonal index for any month as 
the ratio of the total of the variate.s for the month in question to 
the total that would have been expcrieiiced if neither accidental 
nor seasonal influence.s were pres'ent". that is. the seasonal index 
for the /-M month is 


(1) 



The numerator presents no difficultie.s : the obstacle i.s met in 
determining the denominator, since the unknown function 

that is the comsequence of only trend an<i cycle influences. Accord- 
ing to accepted concepts the trend may be represented by some 
smooth analytic function, the cycle is a smooth though not a 
mathematically periodic function — but the seasonal and residual 
influences may inject all sorts of disturbances into a time series. 
We shall make but two further assumptions,-— 

(a) The smooth function y-jTfx) . representing the combined 
eflfect of trend and cycle, may be approximated by the upper sides 
of a series of trapezoids as in figure (I), The area of each 
trapezoid is to equal the area under the function Wmltad by- 
the common ordinates. 

(b) Neither seasonal nor accidental influences affect annual 
totals, Thus we might assume that the seasonal activity in the 
production of coal does not affect the total coal mined within the 
year, but merely concentrates production within certain months 
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EDITORIAL 


and compensates this with a corresponding under-production in 
others. Although accidental disturbances within the year would 
merely attribute production to one month rather than the next, we 
must admit that if one of these months i.s the last of one year, and 
the other the first oi the following year, such an accidental fluctua- 
tion will affect the annual totals. Usually, however, such perturba- 
tions represent hut a small percent of the monthly production, and 
a negligible part of the annual total. 



to 




O 
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lyCt us assume that we are dealing with 2n consecutive years; 
then by the above assumptions the area of the v-th trapezoid 
equals the total production for the n-thyesiX, 7"^ . 

Designating the ordinates corresponding to the v-fh trapezoid 
by and , it follows since we are dealing with equal unit 
bases, 

that is 


, -so that 



= 

2T,- 

^0 




U2, » 

ZT,- 

u, = 

2T;3-2Tf i-tJo 




zr,- 

Uz- 




<-^ 4 - = 

1 

U3 = 

ZT^-ZTs^ZT^-ZJ, 

■hUg 





z“ ' ' 

-^ZT,-U, 




'-‘■Xn 


■■^ZT^-ZT,^a, 


By elementary geometry, the area corresponding to the twelve 
months of the 77 -Myear are 


M, 

1_ 

Z88 




^ZQB 

(Z1 6Z^_J 

, 4 . 3 u^) 

M, 

^208 


* 


i 

”268 

( 

*Zfu„J 

Mu 

f 

( ^r?-f * 

Z3iz„^ 


According to definition, E equals the sum of the values of 
one for each of the Zrt years Thus. 



EDITORIAL 


3fi4 


1* Ug 
* 23 Uj,-t-Ug 




that is 


< 3 ) 


r.yr^ 


‘Jsa 






and in precisely the same manner 

^z,?. r)^^C<^2n - "<> ^ 


But by (2) 

Uq -f u^ * * "••'^ ^xn-l “ ’‘•^' 5 ’* • 

= 2-0, 

where O designates the sum of the totals for the odd years. 
Again, 






<+27^-2Tf 


^ 2- E- 2-0, 


E representing the corresponding sum for the even years. 
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We have finally that 

Cljr, -:jT4 {^3 0 ^E) 

(4) . . 

5- f E-Oj. 

where d represents the common difiPerence , which 

from equation {V) is seen to he 

“iia “72 


It should be observed that we have not imposed the condition 
that the long time trend is a straight line — no matter what the 
law of growth may lx*, the assumption that the trend-cycle func- 
tion, may be approximated within each year by a secant 

line is alone responsible for the equal diflPerences 5 . 

As an illustration let us compute the seasonals for the theoretical 
series presented in the Detroit Edison article, and reproduced 
below. 

TABI,g 1 

THEOaniCAL SfiRIBS. 



1904 

1905 

1906 

1907 

1908 

1909 

Jan., 

906 

1662 

1908 

2030 


1714 

Feb. 

814 

1582 

1860 

1855 


1831 

Mar. 

1138 

1913 

2052 

2077 

1283 

1831 

Apr, 

1215 

1976 

2027 

2088 

1210 

2077 

May 

1343 

1892 

2122 

2043 

1203 

2143 

June 

1236 

1700 

1672 

2093 

1166 

2058 

July 

1254 

2092 

2041 

2060 

1240 

2320 

Aug. 

1702 

1757 

1846 

2163 

1334 

2413 

Sept. 

1457 

1906 

2102 

2262 

1279 

2502 

Oct, 

1564 

1899 

2304 

1946 

1364 

2643 

Nov. 

1596 

1611 

2303 

1475 

1341 

2378 

Dec. 

1836 

2163 

2170 

1153 

1564 

2595 

Total 

16061 

22153 

24407 

23245 

15278 

26S0S 















M nDirORlAL 

TaHUK I Coniinued 



1910 

1911 

1912 

1913 

1914 

1915 

Jan. 

2392 

I'J.ll 

2052 

2554 

2(Ml 

1687 

Feb. 

2514 

1740 

2(Kil 

2330 

1780 

1532 

Mar. 

i 2417 

1895 

2267 

2554 

2194 

1906 

Apr. ' 

28.10 ' 

1805 

24<10 

2800 

2037 

1796 

May 

2702 

21f.7 

2419 

2845 

2156 

2539 

June 

2475 

17.12 

2571 

2696 

1730 

2674 

July 

2211 

1742 

2657 

2314 

1577 

2566 

Aug. 

2249 

2011 

2469 

2525 

1649 

2661 

Sept. 

2108 

1976 

2591 

2377 

1652 

2952 

Oct. 

2203 

2144 

2516 

2850 

1753 

3342 

Nov. 

1875 

2168 

2389 

24'M 

1585 

3093 

Dec. 

1899 

2092 

2435 

2150 

1779 

3182 

Total 

27875 

23471 

28917 

30489 

219.13 

29930 


O- 134 471 

EWr ?o 

6 -- Z96. 14 

Column I of table 2 is obtained by adding the items of table 1 
horizontally. Column 2 is found by rei^ated adding the common 
difference, S = 296.14 to the value 22 559.90. 


Tabu; 2 

Sbasonals by Trai*k/.oii)AL Ri't.ii. 


Month 

Vj 

t :< p - 

5 

Jan. 

22 121 

22 560 

.981 

Feb. 

20 957 

22 856 

.917 

Mar. 

23 527 

23 152 

1,016 

Apr. 

24 411 

23 448 

1.041 

May 

25 574 

23 744 

1.077 

June 

23 803 

24 041 

.990 

July 

24 074 

24 337 

1.015 

Aug, 

24 779 

24 633 

1.006 

Sept. 

25 164 

24 929 


Oct, 

26 528 

25 225 


NoV.-.’’ 

24 308 

25 521 

,952 

Dec. ' 

25 018 

25 817 

.969 

*.1' 

Total’’ 

. 290 264 

290 263 

12,025 
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The following table presents the mean and standard errors for 
the results obtained by Link-relative method, the Interpolation 
method suggested in the Detroit Edison article, and the Trape- 
zoidal method. Obviously, the last lerjuires by far the least time 
in application. 


TvUu.i: .t 


Mcthiid 

M.D 


Unk Relative 

0277 

.0338 

Ifttcrpcilatkiii 

0269 

.0337 

'f rape/oitlal 

.0227 

.0255 


For weekly indices the fonnulae by the trapezoidal rule are 


^ ” TJF* C J 

If one has data for Zni-l year.s. he may either disregard the 
most distant year and tlicn compute seasonals for the hnal 
years, or he may combine the results for the first Zn years (neg- 
lecting the last year) and the results for the last 2n years (neg- 
lecting the first year). This yields an almost equally simple 
formula. 

Applying the Trapezoidal rule to successive overlapping periods 
will reveal the presence of a shifting seasonal. The change in the 
seasonal for Automobile production, caused by the advent of good 
roads and closed models, affords a good example. 



